D0bUHENT RESUME 



EO 224 689 

AUTHOR 
TITLE 

REPORT NO 
PUB DATE 
MOTE 

AVAILABLE FROM 



PUB TYPE 



EDRS PRICE 
DESCRIPTORS 



SE 039 718 * 

Ross, Steven M. 

introductory Statistics. A Conceptual Approach. 
Modified Preliminiiry Edition. 
ISBN-0-8134-2248-5 
82 

403p. 

interstate Printers and Publishers, Inc., 19-27 North 
Jackson Street, Danville, IL 61832 ($14.50, less 
educational discounts). 

Guides - Classroom Use - Materials (For Learner) 
(051) 

j 

MFOl Plus Postage. PC Not Available £roii\ EDRS. 
*College Mathematics; Higher Education; 
*lndividualized Instruction; Instructional Materials; 
Mathematical Applications; *Mathematical Concepts; 
(Mathematics Instruction; •Itatistics; Supplementary 
Reading Materials; Textbooks 

ABSTRACT 

This document deals with basic statistical concepts 
and operations used in the social sciences. The book was written 
under the philosophy that students ente* statistics courses with a 
variety of aptitudes and experiences, and that traditional teaching 
approaches can do little to accommodate individual needs. It sets up 
an individualized course, and is designed to support teaching 
strategies where pupils do much of the learning on their own. The 
Material is divided into 11 units, and the first part of each unit 
informs pupils of the general content area and how it relates to 
other topics. Each unit lists specific learning objectives, and 
provides instruction which include^ a practice test for each stated 
objective. A major characteristic of the text is its emphasis on 
concepts rather than computations, with greater stress given on the 
why of procedures than is thought. to be usual in most books. The 
presentation also attempts to reduce pupil anxiety often asspciated 
with statistics. This is done through an informal writings style that 
intersperses anecdotes, "conversational" prose, end humor. Student^ 
seem to like this approach, and have indicated that it to make 

reading easier and helps concepts stand out in memory. (MP) 



*********************************************************************** 

• Reproductions supplied by IDRS are the best that can be made * 

* from the original document. . . ' * 
*********************************************************************** 



UA Of^AnnWIMT OF BDUCATKM 
NATIONAL INSTTTUTE OF 6OUCATI0N 

EDUCATIONAL RESOURCES INFORMATtON 
t CENTER {ERIC) 
^^ Th» <tocumem ha» beoti reproduced at 
received from the person or orgenaatton 
ongtnating it 
□ Minor change* have been made to impiovo 
feproduciton quelrtv ^ 

• Pom of vww or opmiofw etaied In th» doco- 
mem d(^not neoessanlv reprewm ottictol NIE 
pofltton or potlcv . , 



NTRODUCTORY 
STATISTICS 



"PERMISStON TO REPRODUCE THIS 
MATERIAL IN MICROFICHE ONLY 
HAS BEEN QRA^f^ED BY 



TP THE EDUCATIONAL RESOURCE 
INFORMATION CENTER4ERICK" 



ERIC 



wmm. 

5i'3 



INTRODUCTORY 
STATISTICS 



Modified Preliminary Edition 



ERIC 



INTRODUCTORY 
STATISTICS 



A Conceptual Apprcxach 



Steven M. Ross, Ph.D. 

PROFESSOR 

DEPARTMENT OF FOUNDATIONS 

OF EDUCATION 
MEMPHIS STATE UNIVERSTTY 



\ 




Order from 



THE INTERSTATE 
Printen & PuhliBhen, Inc. 



Danville, illlnolt 61832-0694 



0 

ERIC 



INTRODUCTORY STAmnCS: 

A CfmceptumVAppromch 



Copyrtght ® 19ft2« 1979, by TTw IntMtortv Prinim &«Pub- 
llihia Ina AJI rH^ rtMTVtd Rln^ 



o 



^ Preface 

C ■ ■ ' ' ' ^ 

\ Stetietlce is hardly the noat popular aubject one can find In college 
curiticuluna; In £act» It acarea nany atudenta and producea In them many 
comm>n avoidance and fear reaponaea^. Studenta who are required to take a 
^tiHrlatlca courae often put it off until their laat aeaeater» hoping pert)apa» 
Oiat the program will change, that the atatlatlca profeaapr will retire and 
not be replaced, or tbirt aomehow they will aucceed In allpplng by^ with the 
atatlatlca requirement lncoiq>lete. 

« 

Such negative attltudea, however, aeem leaa the; fault of atii^enta .than 
of atatlatlca profeaaora and the textbooka they employ. The traditional 
approach to teaching atatlatlca auggeata an Image of an old profeaaor (per- 
hapa wearing a black robe and cap) haphasardly filling the blackboard with 
an unlnterpretable array of nunbera, equatloWt and atrange aynbola. He 
aasumes that atudenta will find themaelvea drooling with enthualaam each 
time a new aymbol or equation makea Ita way Into their notebooka, Horat 
of all, the old profeaaor^lL^orea all the material he haa preaented on 
atandardlslng teat acorea and conatructlng *'falr" grade dlatrlbutldna, with* 
the reault being a preponderance of F'a and D'a awarded aa final gradea. 
(Some profeaaora anyway,..) 

On a more serloua level, It la fairly eaay to find faults with the 
lecture approach to teething atatlatlca, when It la the only atrategy uaed. 
Lectures can be helpful, but It la difficult *to construct them in auch a 
way 80 that they will accommodate the needa of moat atudenta in attendance. 
The bright student may find the lecture too elementary; he underatanda the 
material, but queationa the purpoae of hearing it preaented again~he finda 
it much more adaptive to "dose-off." The "alow" atudent may want to learn, 
but since he does not have full command of all required skills » finda the 
lecture almost impossible to understand; he needs more time, more rehearaala, 
a alower, more aystematic preaentation, but obvioualy the instructor la # 
limited In what he can do to accommodate these individual needa. 

The preaent textbook la intended to aupport teaching atrategiea where 
students can do much (or all) of theii^eaming on their own, outaide of 
class An earlier veraion was develgped and employed apecifically for a 
course taught by the Keller Plan or Peraonallsed System of Instruction (PSI) . 
This version maintaina the aame af^irit, al though there is no reason why it 
could. not be employed with virtually any teaching method. To aupport atudent 
aelf-leaming, the following atructure la uaed. The firat part of each unit 
informs students of the general content ^rea and how that content relatea to 
the book* a overall coverage. Thia overview la then followed oy a Hating of 
the apecifio performance objectivea for the uhit. The instructional portion 
of the unit then followa, which includes a "leaaonV and a "practice poat(e|it" 
for each of the atated objectivea. Following the last imit posttest, a com^' 
prehenaive "review teat" la preaented, which covers luiterial relating to all 
objectivea covered. If atudenta are aatiafied with their practice teat pir^ 
formancea they can move to the next major unit. If not aatiafied, they can 
identify the apecific obJectlve(8) on which they are weak an4 then repeat 
the review and testing activities in the selected areata). Aside froi^ this 



•tructuM, two other ch«r«cterlitlc« distinguish this book froa most 
others being sold. One characteristic Is that, there Is more textual ^ 
elaboration and explanation than Is usual— this Is to allow thejiarra- 
tlve to "take the place" o£ the lecturer (should a student-nanaged 
teaching approach— irlthout conventional classes— be desired) . However, 
should the book be usedssteng with lecture classes, the Increased expla* 
,o«tlon can only serve to reinforce what is said in class and to provide 
additional perspectives on the Biaterial. So nany books seem long on 
■athedrfclcs, tables, and foraulas, but short on textual discussion I This 
one is intended to be different. 

A Second characteristic is tlje use of an "infonaal" writing style, 
interspersed with anecdotes, conversational-type prose, and "attempted 
humor. Thus, along the waf^, the reader will encounter such characters as 
"Bubbles the Whale," nhose weighing is used to illustrate the concept of 
"real limits"; "Murri^ Binet," whose questionable attempt to develop a 
neWjatq test is used to illustrate the need for caution in interpteting 
stahlard scores; and "Matasha Bobolinckskismith," for whom the agony of 
defeat in a gymastics competition was used to Illustrate something or 
other. It is hoped that these will not be offensive, but rather provide 
contexts that stand out in your mind and help you to understand and later 
mentally "reconstruct" the statistical concepts they convey. 

Well, enough-»perhap8 too much— has been said by way of introduction. 
But before you start plunging into the material to follow, there are three 
suggestions that may be worthwhile to keep in mind. Here they are: 

1. Hand calculators have become very popular (and inexpensive) J.n 
recent years. If you have been entertaining the thought of buying one, 
now would be a very good time to make the purchase. It will enable you 
to work the probleiM rapidly and accurately. If you cannot afford one, 
there is no need to worry since you can get through doing the problems, by 
hand. But a calculator would be very advantageous, mostly in terms of 
saving time I 

2. In solving problems, the general rule is to carry out your work 
to at least two (2) place decimal accuracy. Sometimes exceptions will 
be In order; use your Judgment. 

3. Do all practice problems. They are there for a purpose and you 
should take advanta^^^^f them. 



Acknowledgenents 



The present textbook owes Its style^ structure^ and mudi of Its 
content to an earlier edition nhich^ In tum» was nade possible 
through helpful assistance pf several Individuals » Firsts the author 
owes a considerable debt af ^ratltuide to Dr. Dennis Roberts, whose 
successful course at The Pemisylvatiia State University provided the 
Idea and a procedural framework fot^ the present approach. 

The writing of this text was sponsored, In part, by a Seed Grant 
from the Center for Instructional Service and ReseJhrch, Memphis State 
University. The author also wishes to thank Dr. Andrew Bush and Steve 
Baucum, who niade very valuable contributions through their editing and 
critiquing of parts of the nanuscrlpt. Denlse Oldham and June Powers 
provided assistance In proofreading the final version. 

Very special thanks are due to Mrs. Sue Douglass, who handled the 
task of typing the manuscript largely through successfully deciphering 
the almost Illegible scrlbbllngs by the author. Mrs. Douglass took a 
special Interest In the project, assuming responsibilities and schedules 
'that went far beyond the call of duty. Writing the book was substan- 
tially easier with her support. 



The author is grateful to the Literary Executor of the late Sir 
Ronald A. Fisher, F.R.S., to Dr. Frank Yates, F.R.S., and to Longman 
Group Ltd., London^ for permission to reprint portions of Tables III 
and IV fro* their book. Statistical Tables for Biological. Agricultural 
and Medical Research (6th Edition, 1974). 



vli 




3 



« Table of Contents 

St 

PREFACE : • • ' 

ACKNOWLEDGEMENTS 

UNIT A: REVIEW OF BASIC MATH 

A.l Addition .... 

A. 2 Subtraction 

A. 3 Multiplication 

A. 4 Division . . • 

A. 5 Order of Operations • 

A. 6 Fractions • • • • • 

A, 7 Proportions and Percentages. . . • . 
A.8 Solving for Unknowns • • • • • • • 

A. 9 • Exponents and Roots 

A. 10 ^unnation. • • • • • 

Unit A Review Test 

UNIT I: ORGANIZATION AND PRESENTATION OF DATA . 

1.1 Real Limits of Scores • • 

Posttefl| • • • • ^ 

1.2;1 Ungrouped Frequency Distributions. 

Posttest ^ 

U2.2 Grouped Frequency Distributions • 

<* « * ' 

Posttest 

•a 

1.3.1 Frequency Polygons, •r. . 

* ♦ 

«*Postte8t . ^ . . . . . ^ • • • • • • 

« 

Ix 



. « . .. 



m * 

page 

1.3.2 Hl8togi(aiBi 32 

Posttest 33 

1.3.3 Ciuiulatlve Frequency Graphs * 33 

Post^est 34 

1.4 ^ FonoB of Freqiiency Distributions. .... 35 

Poattest. ........ .. . 37 

Unit I Review Test . . v 39 

UNIT II: PERCENTILE RANKS AND CEIKTRAL TENDENCY ... 40 

2^1 ' Percentile Ranks . . . . v 40 

Posttest. . . . ^ f . . . ikl 

2.2 Coniputing Percentile Ranks 42 

Posttest • • S3 

2.3 Converting Percentile Ranks Into Raw Scores . S3 
Posttest. S8 

2.4 Central Tendency 58 

t 

Posttest ' . • ^8 

2.4.1 Computation of Means. . T ..... . 59 

Posttest . . . • • • 62 

2.4.2 Computation of the Median ^ ,63 

. .Posttest. ^ 

w 

2.4.3 • Determination pf the Mode ........ 68 

^ Posttest 69 

« 

2.5 Comparisons Between Means » Medians » and Modes 

In Skewed and Non-»Skewed Distributions. • . 69 

Posttest • .... • • • • . • 75 

Unit II'. Review Test 76 



X 

# . 10 



UHIT III: VARIABILITY . . . 

3.1 Variability • 

Post test • . . 

3.2 The Range. 

^ Poattest . . . • 

3.3 The Variance. • 

Poattest 

3.4 The Standard Deviation . . . • • • • • 
" I^oattest 

Unit III Review Test 

UNIT IV: POSITION MEASURES AND THE NORMAL CURVE . . . 

4.1 2 Scores . . . . t • • • 

Poattest .... 

'4,2 Converting 2 Scores into RaM^ Score Equivalents 

. - ■ ., ♦ 

Posttest . . . • 

*- , 

4.3 Converting 2 Scores into J Scores 

.Posttest 

4.4 Normal Curve. 

Posttest' 

4.5 Raw Score and 2 Score Scales in Normal 
Distribution . . . 

Posttest . 

4.6 Percentage of Scores Between Standard 
D eviati ons . . . ' • 

Po^^^est 

4.7 Finding Percentage of Scores Between Any IWo 
Z*s by Use of "Nonnal Curve" Table 

Posttest 

xl 



. . . 



r 



) 



4.IB 



A. 9 



UNIT V: 
5.1 

5.2 



5.3 



5.4 
5.5 



UNIT VI: 
6.1 

6.2 

6.3 



page 

Finding Area Between Any mo Rmr Scores In a 

Mbraal Distribution • •/ 126 

Post test, . . . • • « • • ^ , . 128 

Converting Z Scores Itito Percentile Ranks » 

and Percentile Ranks Into ^ Scores • • • • 129 

Posttest^ . . . . 4 V • 135 

if 

Unit IV Review Tedt 137 

CORRELATION. . . ^ 138 

Definition of Correlation 138 

Posttest 141 

Conducting a Correlatlopal Study: Data 

Collection • . . « 142 

Posttest ISO 

Construction and Interpretation of 

Scatterplots « ISO 

Posttest. ............ 1S7 

The Correlation Coefficient ^ ^ 1S9 

Posttest ( . . . 162 

CoiDputatlon of A. J64 

Posttest . ^ . /! . 172 

.Unit V Review Test 1 

^ROBABILm ^.^.\ . . . 177 

Probability^. What Is It? . 177 

, Posttest . 178 

0 # 

Computing Probabilities on a Single Trial. • 179 

Posttest. '182 

Computing Pro^bilities over Multiple Trials. 183 

Posttest . . : 189 

r ' . 

xll ' • 




Unit VI Review Test . * 



X ■ 

UNIT VII: MAKING INFERENCES FROM SAMPLES 



7^. 1 Randqm Samples 



Posttest 



7.2 The Relationship Between Parameters and 
* Stjjtistics . . • ^ • 



Eosttest . J . . 

7.3' The Notion of Bias , in Statistical Inference. 

Posttest . . . . 

7.4 ^ Sampling Distribu;:ions ' of Means. . . . . 

Posttest ... . * • 

\ 

7.5 Computing Probabilities for Obtaining \j 
Sample Means . . . ... . . * • v • 

, ' \ 

Posttest 



IJnlt VII Review Test . • * • • 
UNIT VIII: HYPOTHESIS TESTING ...... 

8.1 ^ Rationale and Basic Procedures . 
* ^ Posttest \ . • . . . 

8.2 The Hull and Alternative Hypotheses 
Posttest . . . . . . . . . 

8.3.1 Type 1 and Type II Errors. . 

8.3.2 Probability Level 

Posttest ^ . ^ . . , . . . 



8.4 Testing Hypbtheses When Population Parameters 
. Are Known: th€^ 2 Test ^ . 



Posttest 



8.5 Testing Hypotheses When Population Parameters 
i Are Unkilown: the t Test - 



xiiit^ 





■ i' 








• 

* 


• 


♦ 

Unit VIII ReviewA^Test • • • • • • • • 


page 

. 267 
269 


- ■: 




UNIT IX: 


* SELECTED INFERENTIAL TECHNIQUES . . . ^ . 


271 




• 


■ 9.1 < 


' Testing the Differences Between Two Means 


271 


• 


I 


9.1.1 


The t Test Using Independent Samples . . . 


272 


* '■■■i 




9.1.2 


The t Test Using Dependent Samples. 


278 
r'*284 




I 


9.2 


Testing the Significance of the Correlation, 


286 
.288 






9.3 


Using Confidence Intervals for Parameter 

* 


289 
295 
^6 

t 


\ 




UNIT X: 




299 






• 10.1 


Nominal Scores and Their Analysis Through 


• 299 




✓ » 


10.2 




301 






10.2,1 




30 r 


. 1 




iO.2.2 




303 










306 


J 




10.3 

1 


Using Chl-Square In a Two^Way Test. . : 


307 
311 










311 






UNIT XI: 


INTRODUCTION TO ANALYSIS OP VARIANCE . ; . • 


314 

• 


% 




11.1 ~ 


Using Analysis of Variance In Research . 


314 


" "■■Vi 


w ■ 






316 




jERJC 


• 


J 

xlv 







Posttest . . . • • '• 
11.3 Interpreting ANOVA IWsults 



' Posttest 



Unit XI. Review Test 



/■V 



\ 



page 



11. 2 Understanding General Procedures for 

Computing ANOVA . . • • • • • * * * 



XV 



3Z2 
324 
326 
326 



^ . 331 

ANSWER SECTION * * 

.... 383 
APPENDIX* . . 

385 

Table I: Normal Curve Area . . • • • * * * 
Table II: Significant Levels of the t Ratio. • • • • 
Table III: Significant Levels of the Chl-Square Test 387 

. . '389 

INDEX'-. 



UNIT A 
REVIEW OF BASIC MATH 



Prerequisite Skills 

, When you turn the page, you will find what la called a "review" 
unit on basic arithmetic and algebraic operations • Although 
the present course attempts to enphaslze mathematical skills 
06 tiXXJit OA DO&iihtt. there are some rather basic operations 
which you will need to perform as you move through the "eleven 
units." All of thes^ basic operations are reviewed In the ^ 
following section, and It Is strongly recommended that you read 
through these and attempt to diagnose any weaknesses that you 
might have. 



Procedure for Rcvleir 

Proceed through the review unit, perhaps spending from 30 
minutes to one hi|ur on the entire section » Once you feel 
reasonably comfortable with the various operations covered, 
take the review test which appears at the end of the uriit. 
Then check your answers In the ans^r section. If you score 
90Z or above, math will probably not be a problem for you, but 
still go over any questions that you missed, and try to Identlf]^ 
what the errorsllnirolved. If, however, you score less than 80% 
you definitely need, reviews reread the review unit and think 
about stopplilg In for help. Do nbt f>egln working on Unit I 
until you h^ coiqpletely mastered the various math and algebraic 
operations that are covered In the. practice test; If you begin 
before mastery Is achieved^ you will only be rnaklni; It more 
difficult for yourself leter on In the course. 



Begin now. ..and good luckt 



.1 Addition 



Rule A t' When positive nusbera are added together the 
anaver has a positive sign. 

Ezaaples: 15 13X 

M JL (+) Jx * 
18 19X 

Rule B ; When negative nuabers are added together, the 
answer has a negative sign. 



Bzanples: - 1 

W-15 
-16 




• * 

Rule C : When positive and negative ninbers are added 
together the anaver haa the algn of the larger 
quantity. 

Ezanplea: 13 -27 

M 15 
+ 1 -12 

^ While the above rule (jRulf Ic) for determining the (nropcr algn la 
eaajt * nore baalc prdlbilik/for sany atudenta 111 performing 
the math operations Here ia a rule fbr that: (even though 
ve are dealing vlth an adaption problem) aubtract the 
amaller number (ifhether Itf la the poaltlve or the negative 
one) frcn the laitger number • The result la your anaver; 
all that rsmalna la to prjovlde the algn. 

Example I -57 

M 32 
-25 

tthift Ihe flrat rule atiitea that ydu muat uie the algn of 
of the larger nusDer; thua, the anaver vould be 
Negative," The/aecond rule atatea that you ahould 
aubtract the amaller from the larger (forgetting for 
a moment about ^Igna); thua» the anavet vould be **25.** 
Putting both nslea together glvea the final reault» **-25s* 



2 . Subtraction 

Subtraction^ for aoiMK people » la mudi more troubleaome than 
addition 9 partlculltrly vhen negative nuabera are Involved* 
If thla appllea toi you» It will no longer » provided that 
you can learn the/follovlng rule: 

Rate: Given th^ task of subtracting two numbera^ ri^rae 
' the algn of tbe/nua|^r to be aubtracted (lse«» the aubtra- 
hend) and treat/ the |>roblem aa If «dt vere one of addition, 

^ / • . 



Exaaple: . 5 (Rote: this one probably seen* •very 

(.) ^ eM7t ^ut the laportant task for now 

'2 la to leaxn. the rule ao that you 

will be able to apply It to fl»re 
difficult exaoplea.) 

Vhyt Ualng the abov« rule, the aubtrahend, 3, la converted 
to a "-3"» and the problea becomea one of addition: 
5 (4) -3-2 (rember addition rulea given above). 



Bxaa^lei 



-10 
(-)±i2 
? 



Eza^)le: 



Exa^)le: 



+15 
('•)-22 



- 8 
? 



For thla one, apply the rule In the 
9 am aanner: (a) Change the algn of 
aubtrahend froa a +12 to a -12; (b) 
dien treat the problen aa one of 
addition. -10 (+) -12 - ? 

An%> - -22 

(If you do not underatand Uhy, check 
Rule "B" under "Addition.") 

,Change the algn of the aubtrahend and 
add: +15 (+) +22 - ? 

Ana. - +37 

Change the sign of the aubtrahend and 
add: -8 (+) +6 - ? , 

Ana. - -2 (aee Rule "C" under 
"Addition" If you do not aee hoir 
the anawer waa derived.) * 



ERIC 



A.3 Multiplication 

Althou^ aultlpllcatlon can be handled eaally by a calculator 
(or by hind If you are careful with calculatlona) , what glvea 
aoae people probleaa la the question of what algn to uae In 
the anawer Here are two alaple rulea for detemlnlng algna 
with aoae exaaplea which ahow how they *are applied. 

Rule A: If the two nuabera being aultlplled have the aaae 
algn, whether "+" or "-," the anmrer will have a "+" algn. 

Rula B : If the two nuabera being Multiplied have oppoalte 
algna, the answer will have a "-" sign. ^ 



Exaaple: 23 
(a) 4 
? 



Since the two nuabera have the aaae 
sign, the anawer will be poaltlve. 
Do your aultlpUcatlon and attach 
a "+" algn to your anawer. 

Aaa. ' •i92 
3 



.^■.■5.. 



H% M, 



Since the two nunbers have the same 
sign 9 do your multiplication and 
attach a positive sign to your 

anaver. 

Ana. >> +228 

Since the t«ro nuobera have oppoalte 
algnsy do your multiplication and 
al;tach a negative ("-") sign to your 
anaver. 

Ana. - ^1008 

Example: -27 (z) -3 (x) A (x) -10 - 7 

There are a nuntber of different ays- 
tema people use for multiplying a 
atrlng of nuobera of the kind ahoim. 
" * Perhaps the eaaleat way to atay out 
of trouble Is to multiply only two 
numbers at a time while applying the 
above rules to determine the nifsp. of 
each auccesalve answer. This Is bow 
It would be done: 

Taking the flrat two nuaiiberai -27 
(x) - 3^ we get -h81 (signs were the 
same) • 

Then multiply the 4^1 by the next 
number In the atrlng: +81 (x) +4 " 
+324 (algnt vere the same) • 

Then multiply the +324 by the final 
number 9 -10 to yield 324 (x) -10 - ? 

Ana. - -3240 * 



.4 Dlvlalon 



Example: -57 
(x)- 4 

? 



» « 

Example : -36 
(x) 28 



Kike multiplication » division can be easily handled by a 
calculator t but there la sometimes a problem In determining 
the appropriate sign of the answer. Fortunately ^ the rules 
you Just learned for multiplication apply to division In the 
Identical numner. 

Rule A : If tlie two nuobirs have the aame algn (*'+** or **-**)» 
the answer is "+." 

« 

ExMples: 9 (♦) 3 - 1; -12 (*) -4 -3 



Rule > : If th« algna of the 'two nuid)er8 are opposite, the 
enmer Is 'V." 

9 

Eznq>leas 15 (♦) -5 - -2; -21 (*) 3 - -7 

In caiHS you ere to do division by hand, here are two more 
rulea to keep In nlnd. ' ' . 

♦ 

lUile X t ReHeiri>er to divide variable labela, audi aa "X" or 
aa part of the overall problem: X (*) X - 1 

(f) Z - Z (Note that as a check, Z 1c Z - Z^) 

Z* (♦) Z2 - z2 (Note that z2 x Z? - Z*) 

Ezaoiplea: 15X (t) 3X - T » 

Ana. " 5 (X's cancel out) 

391^ (♦) -3Y - T 

Am. « -13T^ (note that answer la negative 
alnce algna were oppoalte; , 
alao (*) Y^coroa Y*) 

If you* re having trouble with problens Involving poirera (ex* 
ponenta)p reiieri>er that exponenta are added during vultlpllcatlon. 
anjd subtracted during division. Thus^^a nuojber to the aecond 
power tinea a nuaber to the fifth power ylelda an answer expressed 
to the ieventh power (2+5); or, a punbi^ tb the eighth power 
divided by one to the third power yields kn answer expressed to 
the fifth power (8 - 3)» 

Rule D : In dividing nubers Involving declsMls (without a 
calculator) » make the dlvlaor a whole number by moving Ita 
decl|Ml an appropriate nunber of places to the right; |!:hen 
do tne aaae operation for the dividend by novlng Ita di^clmal 
and/or adding seroea to equal the total placea the dlvlaor* s 
dctelsal was Mfved. 

Exaaple: 15 (the dividend) ^•5 (the dlvlaor) « ? 

% Flratp Mke the divisor a whole 

nuribfer by moving Ita decimal one 
place to the rl^t. 

15 0 ♦ 5 

' ' ^ ^ The dividend has no declmsl so Jj^ 
Ans» 30 zero needs to be added » as ahown*. 




ERIC 



Bxnple: 100 « 2 « t505 " T' HeM^ a three-»pUce aov^Miit Is 

needed In the divisor « We mist - 
100200. « 505 ? do the sane for the dividend: 

■ovlng Its declnsl gives 1 place; 
^ Ans. ■ 198 1 42 .adding 2 seroes gives the total » 
(rounded to 2^ as shonn. 
declnsl places) 

Actually t such problems should be set-up In long division fom. 
The last exavple will illustrate this procedure. 

Exanple: -13.1 * 5.96 - ? 

(a) Set up as:' 5.96/ - 13.1 

(b) Move declnsl two places In the divisor. Equal- 
ise by novlng the dividend's declnal the one 

* possible plice and adding 1^ sero. 



596A - 1310 

(c) Answer will have a negative sign because the 
two nuri>etii are, nlzed. To avoid confusion » 
put proper sign In the answer space and re- 
nove the fron the dividend. 



596/ 1310 
(d) Mow work the problem. 



- 2^197 



596/^131000 
1192 
1180 
596 



5840 
5364 



4760 
4172 



Note: We have 
added three de- ^ 
clmal places to 
the dividend so 
that our answer 
will have two- 
place accuracy. 



Anst ■ 



- 2.20 



A. 5 Order of Operations 

In perfomlng a jequence of operations » do your calculations 
In the folloiwln|j|brder of priority. 

(1) Do work In parentheses 

(2) Raise to powers and /or take roots 

(3) Multiply and/or divide 

(4) Add and/or subtract 

2i 



Bza^lc: (5 - 3) x 7 - 7 First, do°work In ( ). . 

(2) X 7 - ?^ fhgn ■uieiply . 

Ana. " 14 . 

Bznpl«: -14 (-) 2 x -2 » Flrit, take Maumie root . 

-14 (-) 2 X -2 + 2 - ? Next, Multiply 2 by -2. 

'-14 (-) -4 -f 2 " ? Then, add/aiib tract fron left to 

rl^t. Start vitn -14 (-) -4. 

-10 + 2 - ? \ 
Ana. - -8 

ixanle: (12 - 3) x 4^ (+) -12 +Vir* - ? Flrat, /lo work In ; 

9x4 (+) -12 ♦Vlfi ■ T Then do powere and 

^ roota. 
9 X 16 (+) -12 ♦ 4 - T Then Multiply and/or divide 

froM left to rignt; auici - 
144 (+) -12 If 4 - ? Plication la £:|rat here. 

144 (+) -3 - T Then add . 

Ana. s 141 

A. 6 Fractioif 

» ■ 

Hule A t For adding or aiAtractlng, convert to coanon de- 
noalnator, then reduce to loweat denoalnator. 



ERIC 



Exa^plea: 3 J. 2 . 1. 4 2, . 3 
12 4 4 4 4 

5 1 . 15 . 4 , II 
16 "l2 I? 48 48 

2x2-8. 6, 14. i 
3*4 12 12 12 6 

Kttle B ; For ■ultlpllcatlbn, wdtlply nuMrator and deno«i- 
nators totetbcr; then reduce. 

Bza^ileai JL • 1 . isl . 1 
5 8 5x8 40 

12 2 X - 12x2x3e _ 24X - X 
"3 • 8 • Y 3xftflr 24T T 



7 



. Rule C : For dlvldtng» invert the fraction In the dlvleor and 
then Bultlply *a usual, 

Bxaqilea: 1^2^1^3.3.1 
3 "6 » 2 "12 "4 

3 ^1^3 4 .12 .3 
8 4 8 *1 8 2 



7 Proport^ona and Percentagea 

lUile A : A proportion la the dedwl equivalent of a fraction. 
To convert a fraction Into a proportion » alvply divide Ita 
denominator (bottoa nuaber) Into ita numerator (top nunber). 



Bxmplei : ^ m j 



.80 .777 
5/4.000 9/7.00a 
4 0 ±2 

70 

An». - .80 6 3 

70 
63 

Ana. - .78 (rounded off)« 

Rule B : ^ A percentage Is a proportion expresse^on a scale of 
100. Tp convert a fraction Into a percentage, first change It 
to a proportion (Rule A)^; then nove the declioal point two 
places to the right. 

Exmplea: Convert the following proportions to percentagea. 
.80 - T .7986 - T 1.86 - ? 

Ana. - 80Z Ana. - 79.86Z Ana. - 186Z ^ 



8 Solvlnt for Unknmma 

There la no one way to alapllfy an equation ^ and thua, If 
certain aethoda learned In prevloua aath couraea atlll work 
for you, continue to uae them. If, however, thoae aethoda 
• were never really well learned, you aay want to try the 
conceptual approach llluatrated below. 

Rale A ; In trying to aolve for an unknown estpreaalon O'X" 
or "T," etc.) in an/cquatlon, laolate the unknown by vovlng 
all known (I.e., nuaerlcal) valuea to the oppoalte aide. Tha 
result will be: "iiidbfdNIK - a cvMh voeue." ; 



8 

-2a 



BuMples: Itor X - 5 - 8 you v«nt to maw the "5" to 

the right aide -to Isolate, 
and then aolve for,X. 

Tot 6X - 37 - 197 you want to Isolate X hy vovlng 

"To the "6,." "37" and "10" to the 

right aide. Once you accov- 
pllah thla and alapllfy the 
right aide, thifreault will be 
"X - certain value." At that 
point, this equation Is solved I , 

• • 

Kuic B : The equation wist always be balanced: right side , 
nist equal left. . thus, whatever operation you perfom on one 
^^de wist be repeated exactly on the other. 

Bwle: X + 5 - 10 ' Aa will be discussed below, yoi 

would witnt to Isolste X by sub- 
tracting "5" from the left slde.» 
But If you did wily this, the 
equation would "tip" (I.e., be 
lAalanced). Thus, "5" needs 
to be^subtrsi&ted frov the right 
side as well. 
X + 5 -(5) - 10 -(5) * 

3t - 5 

R ule. C ; The way to rew>ve a nutf>er f roii X's side of the equa- 
tion la to apply It' again to each aide of the equation, but 
In the "oppoiUt" way fro« which It la originally used. 

1. Thus, to rew)ve a value being added to X'a aide, we aub- 
tract that nuaiier from both aldka of the equation. 

Emle: X + 15 - 30 - 2 Flrat, we night as well slwpllfy 

the right. V 
X 15 - 28 . want to reaove the 15, which 

' Is* being added , ao we aubtract 

15 » a. The 15 'a on the lett cancel 
X + 15 -(15) - 28 - (15) o^it, leaving (yily/'X-." ^ 

X - 28 - 15 
Ana. X - 13 

2. To rewyve a value being aubtracted fwp X'a aide, we add |^ 
that nuaber to both aldea of the equation. 



ERIC ./^ ^ 



Ezaq>le: X - 27 - 15 « 3 First* aliqillfy the right. 

X - 27 « 5 Then reftove the -27 by adding 27'i 

. ^ ■ * .> • . 

X - 27 +"C27) - 5 +• (27) The 27'« cancel out on 

the left. 

X - 5 + 27 
ins. X - 32 

3. Tb reaove a value being wiltlplled by X*s sld^, divide . 
that value Into both sldei. of the equation. 

Ezaaple: 4X 40 Reaove the 4 by dlvd^dlng each 

side by 4. . . 
4X . 40 . . 

4 4 The 4's cancel out on theAeft. 

% X - 10 7 — 

4. To reaove a value being divided Into X's side, aultlply 
that, value by both sides of the equation. 

Exaaple: X . -jj . j First slapllfy the rl^t. — 

X . ,k . RiMve thf 2 by aultlplylng , 

. 2 each side by 2. ^ ♦ 

2 'X X ■ 10 X 2 The 2's cancel out on the left.jT 



20 



5. Vhen aore than one value needs to be reaoved, the order 
tfn iihldi you should proceed Is : 

(a) slapllfy both sides as auch as possible before 
trying to reaove any nuabers. 

ExAaple: ISX - 4 « 2 -I- 14 - 10 f 5 -I- 16 

^5X - 2 + 14 - 2 + 16 4 

15X + 12 - 18 

CbnfvssdT Sea Sec, A. 5 - Order o^ Operations for help 

(b) then start your "replaceaent" with the value op X^s 
r side that affects the 'Sfhole" expression rather than 

Just one part. « 

Which value Is it In the e^aaple, tke "15" or the "I2"t 



25 



Am.: It !• the "If" .lace It affect, the "nhole" 
15X; the "15" only effecte "X". 

Thus,, we rcKwe the "12" 'flret:^ ^ 
"~ 15X + 12 - (12) - 18 - (12) 

• 15X - 6 , . • 

(c) Hevlns "rewived" the^prljury velte, then wrk 
vlth the remaining one. 

15X - 6 : 15X : X - 6/15 ^ 
IT 15 



Try theae exanplea of Hcoaplex equations: ^ 

tr - 5 - 10 - 2 X 2 Plrat alntpllfy (the^rlght tnly la 

\ ' • pof alble here)'. ^ 

(X^- 12) ♦5-10-4 

/• . 12\ ♦ 5 - 6 ^ Whlch»nuiBber do "we remwe flrat on 

the left? ^The "5" III. aa It af- 
fecta both X and 12; the 12 only 
y affecta X. Remove by- multiplying. 

X-12 X 5 X 5 . The 5'a'^i^el out on the left. 
"5"" 1 • * 

J5,j2 - 30 Then remove "12" through addition. 

X.- ,12 + (12) - 30 + (12) The 12 'a jcancel out on the left. 



Ana. i X - 42 



V 



Tx5^3 +1.3^2 + 2 -iWl^y ^ * . 

5T + 3 X 1 - ft . ^ ■ ' left-aide valie fOM^"*!'^* 

5Ti3 +1-8 ' „i„ goutae, aa 16 

. the iihole expreaalon. the af- 

fect4 t only; the "3" •«f«cta 
"5T" only; the "2" affaic.ta + 
3" only. thi»» remove the "I" 
through- atditraetion. ^ 

5T + 3 + 1 - (I) - 8 - CD T^* <Mnc«l 

— f — al^>llfy right. > 



5Y -h 3 Vhlch goes next? The ••2" of courae 



• 7 



as it affects the most values* 
MCiltiply to remove it. 



5Y + 3 X 2 ■» 7 X 2 -2 's cancel out on left; simplify 

2 1 right. 

5Y + 3 - 14 Which goes nextf?^ The "3" plnce.lt 

' _ affects the Whole "5Y," Subtract 

to renove It. - , 

5Y + 3 - (3) » 14 - t3) 3^s cancel out; simplify the right. 

5Y • 11 ^ Now remove the "5" thtough division. 

Y - il/5 Is the Answer CWhewl) 

. - . - ' 1 

A. 9 Exponents and Roots 

Rule A : The expression means raise X to the 2nd power by 
multiplying It by Itself" one time; X x X • ^ • 

. Exraples: 10^ - IQ x 10 > 100; 52 - 5 x 5 - 25 

is '.' • 

For X^, multiply X by Itself two times: X x X x X » 
Example: 2^ « 2 x 2 x 2 « 8 

Us^ the same basic logic foi^^all cfxponent values, X^, etc. 

Rule B : The square root of a number Is a number that when 
multiplied by Itself wl];l yield the original value. 

Examples : What Is V16 ? What we are really asking Is 
wHat number will yield *'16" when multiplied by 
Itself. The answer Is 4. 

^ n/144" - ? ' Ans. : 12 since 12 12 - 144 

Square toots can be found very easily using a 
calculator. 



A. 10 Summation 

A very frequently used symbol Is the capital Greek letter (£) 
Sigma It means **sum of" or "add up." 



12 



A, For the follonln^ values of X, what la £X? 
X- 2. 4. 6 

' List X^s In A coluna» and sum ^oim to obtain answer: 



2 : ■ / 

_6 

IX - 12 . • , 

B. When X*8 consist of scores of 1, 4, and -3, what Is EX ? 

List X*s In a coluon, but note that problem asks for the 
Sum of the "squared X*s." Tbiis» an adjac^ent column list- 
ing the x2*s Is needed to obtain the required result. 

X 

' 1 1 

4 * 16 

=^ J. 
JOL ' 2 * EX - 26 The answer is 26 . 

C. For the sjune data as la "B," What is (EX)*? 

The parMirheses makes the problem read: What Is the 

square olVthe sum of all the X*s? Do you see the dlf- i" 

ference between this type of problem and that rep'«h I 
sented in "B"? V 

To derive the correct oiswer.'you *um X's (X), and then j 
square tht result. As shown In "B," ZX - 2. Thus: I 

(EX)2 f (2)2 - 4 

D. When X - 1, 3, and 2, and Y - 7, I, and 4, what Is EXJfTf 



This one mi^ seem a little tricky, but It means: what Is 
the sum of the products of each X multiplied by Its cot- 
responding YT In other words, add the "XI's" together. 
We start by making an column, a Y column, and then aft 
XT column. Suanlng down the latter gives the result. 

X Y . TH (pote: this Is foxmed 

-~ by X X Y) 

m\ 1 7 

"•3 .1 3 

2 4 8 

jX-'5' EY-I5" tXy-Tff The answer Is 18 



13 



Unit A Ravlcw Test 



Check your answers in answer section. If you score below 80% 

you need review, and you should check the appropriate sections 

in this text. 

* 

addition of positive and negative nunibers 

1. 36X + 23X - 7 

2. -17X (+) 9x - ? • 

3. -22X (+) -16X - ? 

aubtract'lon of positive and negative nunibeta 

fi, 13Z (-X -4Z - ? 

5. -20X (-) 6X - ? 

6. -8 ir) -13 - ? 

« 

■ultlpllcatlon of positive and negative nuaibera 

7. -22 (x) -4 - ? 

8. 7W (x) -9M - ? 

'A 

division of positive and negative numbers 

9. -6X (♦) 1.5X - ? 

10. -5 (♦) -2 - T 

^ ' P . . " . 

order of operations 

11. 5 + (15 - 2) X 2 - ? 

12. 4 ♦ 2^ + 5^ - r 

13. 8 X 2 + 6^ ♦ 3 - ? 

equations 

14. Solve for X: X - 10 + 4 - 13 

15. Solve for T: 4/5 - Y/15 

16. Solve for X: (X - 6 t 2) X 2 - 12 

^ squaring and aquare roota 

17. VST - ? 

18. (14.2)2 .7 

« 

fractions and percentagea 

19. 1/3 (x) 3/9-7 

20. Coopute as a percentage: 13/15 



14 

29 



itlon 



X 
11 
10 

8 



T 

3 
5 
2 



<9» 



.21. SX - t 

22. ET - t 

23. £X^ - t 

24. (ZI)^ - t 

25. m - T 



15 



30 



UNIT I* 



OBGANIZATION iOID PKBSENTATIOH 



OP DATA 




Ihe "frntj word ''statistics** suggests to mmj people (and 
pattleularly thoee In statistics courses) a confusing mss 
of nuidiers^ difficult to work with and inpossible to inter- 
pret* The objectives of this unit are directed towards teach-' 
. ing the student how to take a set of data and organise it 
into ttanageahle font* Usually this involves constructing 
what is called a frequency distribution nhich is later trans- 
foned into a ttore graphic visual fom - a frequency polygon ^ 
Ihe preset material deals with the construction and use of 
these organising devices 'In psychology and education. 
• 

Specific Objectives 

1»1 Identify the upper and lower limits of scores r 
1*2 Differentiate betvfeen grouped and ungrouped frequency 
distributions 

1.2. 1 Construct imgrouped frequency distributions 

1.2.2 Construct grouped frequency distributioiv^ 

1*3 (Construct frequency polygons his togrfOM^ cumilative 
' frequency graphs » and define uses of each ^ 

'l«4 Be able to identify the fots of distribution with 
respect to iu>dality» skenness^ and shape 



Whenever ve work vith nuabers^ a problea that is fre<- 
quently encountered concerns whether the neaaure is continuous ' 
or discretet Discrete aeaaures can only f all* at certain sepa- 
rated points on a scale* For exasple» if we were to tMasure 
(determine) the nuaber of students enrolled in dififerent sec- 
tions of a atatistica courae» we night end up widi nuabera 
auch as 5, 21 » 16» 32» 7» 18» etc* These are diacrete aeaiurea 
for they .fall at aeparate integer pointa* Tou can have 5 atu<» 
d«nta in jyour class, but will rarely find instancea where atur 
dent enrollment ia 5*5^, or 5*25* Siwllarly, the number of 



* Inatructional Phltt Real Linlta of Scores 



16 




children in • faaily, tha ntnber of aultlple^holce Iteu 
conectly anmeted, the noAer of autoaoblles in • person** . 
tarste* or the rnaher of aice in one** haeeaent can only 
take on Integer values, and thua reaain dla crate. 

Other scale values, however, are not restricted to cer^ • 
tain polnta. The only reaaon that they «ay appear to be la 
related to the lladted aenaltlvlty of Qur aeaaurlng Inatnf^ 
aenta. For exaaple. If you were aaked to aeaaure pendla, 
given a ruler. tiiat ahowa only whole Inchea (not half a, fourtha, 
or alxteentha, etc.), the acorea reported for the flrat three 
pencila alght be aoaethlng like 9 in., 12 in., and 5 in. BUT 
ia the firat pencil really exactly 9 in. on the noae (or ion 
the »'tip" to be »ore exact)? If we gave you a ruler that waa 
aeniitive to 1/10 of an inch, your anawer might change to, aay, 
9.2 in. Sn^ia it really exactly that? We could give you a 
third ruler that ia aenaitive to 1/100 of an inch, only to 
find that you now report the pencil to he 9.23 in. Are you 
getting the **point7*' Whercaa ctiie^Ete naaaurea take only cer- 
tain valuea, eOKUniuaA aeaaurea. audi aa hei^t, length, tlae, 
age, ability, weight, etc., can take on an infinite nuaber of 
valuea.. For practical reaaona, aa well aa becauae of phyaical 
llaitationa, we reatrict theae valuea according to the aenai- 
tivity of the aeaaureaent inatruaenta we eaploy. Age, for 
exaaple, ia coaaonly aeaaured in year unite (but recognise 
that no one ia exactly 31 year a old or 57 yeara old, etc. given 
the infinite poaaible valuea of tlae); gaaoline la coaaonly 
aeaaured to the neareat 1/10 of a gallon (but doea your car 
really take exactly 9.2 or 5.7 gallona to the aolecule?); and 
wei^t ia coaaonly aeaaured to the neareat whole pound (but 
no one ia exactly 130 Iba. or 146 Iba., etc.). Thua, whatever 
degree of aenaitivity our aeaaureaenta perait, we end up with 
a aeriea of poaaible valuea. Between each pair of aucceaaive 
poaaible valuea (let* a aay 9 in. and 10 in. where length ia 
aeaaured to the neareat whole inch) la a "gap" that dividea 
one value froa the other. It ia aaauaed that the ares below 
the halfway point in the gap (which would ^e 9.5 in. in our 
exaaple) ia aaaociated with the lower poaaible value (i.e., 9 
in.), whereaa the area above the halfway point ia aaaociated with 
the higher poaaible value (i.e., 10 in.). Ihia aay be getting* 
a little heavy at thia point, ao we*ll atop the narrative for 
avhile and conaidar aoiw concrete exaaplea: 



1. Speaking of heavy and concrete, suppose that on a acale 
that recorda wel^t ^o. the nearest whole pound only, < 
"Bubbles the Iftisle" weighs In at 4,356 lbs. Whst Is her 
reel wei'{(ht? Nobody knows; but since the scale recorded 
her at 4,356, we do know that It la closer to that poaaible 
(whole pound) value than to other poaaible whole pound 
* valuea. More specifically. Bubbles* real weight paaaed 
the 4,355 point on the acale. Thua, we know that her 
real weight haa to be aonewhere above the halfway point In 



17 



the tap betwten 4»355 and the aeuured weight A, 356^. If 
It wee below th* halfwey point, the 4,355 - not the 4,356 ^ 
would bcve been the value xecotded. What la the halfway 
point? The anwer la 4.355.5. If the real weight waa 
%elow that, the needle on the acale would have atayed at 
• 4,355 rather than wving, aa It obvioualy did, to 4,356. 
The other aide of the coin la that Bubblea never nade It 
to the next poaalhle weight, 4,357 Iba. (although ahe 
vlght after lunch) . What that wane la that her real 
wel^bt also haa to be below 4.356.5 . the halfway point 
between the aeaaured weight of 4,356 and the next poaalble 
higher one, 4,357. If It had been above that point, the 
needle would have tipped to the 4,357 rather than ataylng 
where It waa. Getting the picture? Very quickly try 
thla one: Jack, Bubblea* trainer, la recorded on the aaae 
acele aa weighing 131 Iba. What are the lluLta of hla 
real weight? 

Ana. la 130.5 (loner) and 131.5 (upper). Why? The next 
poaalble loweat weight would have been 130 Iba. Given 
that the needle paaaed that and went to the aeaaured 131 
Iba., we know that Jack haa to be aonewhere above 130.5. 
On the other hand, the next poaalble higher weight would 
have been 132 Iba. Given that the needle never made It 
to theie, we know that Jack haa to be aonewhere below 
131.5. 

If you underatand the above exaaiple, you aay want to aklp 
thla one and go to #3. Here It goea. Suppoae that your 
boaa glvea you a box of pendla and a ruler, and aafca you 
tb aeaaure each pencil to the neareat Inch only (If you 
reported ao»ethlng like 10.1 In., you would be fired i»- 
aedlately). Well, you take out the flrat pencil and line 
up Ita bottOB with the bottoa of the ruler. The reault 
la llluatrated below: 

Muamm pbncil mmAmju mcmmmx 



i 2 3 4 5 6 7 ^ 

' raHi 

What would y<>u report? Aaauslng you wanted to keep your 
Job, yott would write down 6 In. for pencil #1. !• the 
pencil exactly 6 In.? Of courae not, but why would you 
plcl; "6" Inatead of "5" or "7"? The obvloua anawer, 
"It 'a cloaerll" In the aane aenae, try thla one; How 
far can a, bear go Into the wooda? 

Ana.: Halfway, becauae once he/ahe paaaea that point, he/ 
ahe la leaving the wooda 11 



18 



Bov far em a pencil extend within the gap betvjean 6 and # 
7 In. and still ba a 6 In. panell (If Maaurewnta are 
reatrlcted to nhola Inchaa)? 

,Ana.i halfway 9 that la^ tp 6,5 In.^ becauaa after that 

point the pencil becoMs a 7 In. pencil? 

Final queatlon: What are the ilalta of the real length 
of the pencil uaed In the exai^le? 

Ana.: 5.5 (lower) and 6.5 (upper) ^ Aa far aa ••real" 
length la concerned » 'thla boundary la all we can know 
for aure. ^ 

i. The ••limits'* we have been talking about similarly apply 
to scores more precise than Integers* Suppose that we 
are racing two turtles In the 10 Inch dash. Recognising 
^ the possibility of a new world record » we bring two atop-* 
watches^ capable of t:ecordlng time to 1/10 of a aecond^ 
to determine the speed of each turtle. The race beglna. 
Turtle known to be a aprlnter^ takea an early lead, 
but tires at the 8 Inch pole. Turtle B rapidly (for 
turtles) closes the gap, and aenalng victory , pulla away 
during the final stretch and breaks the tape flrat with 
his shell. Tfe check the stopwatchea for recorda. There 
are none, but both flnlahera are recorded at 9.8 aeconda. 
Assuming no timing errors were made, how la that poaalble 
since Turtle B waa clearly the winner? 

Ani. I The 9.8^s do not represent the turtles* exact tlmea, 
but rather, only the ••beat^^ eatlmatea given the llmltatlona 
of the timing Inatrumanta. What can we aay for certain 
about the real tlmea? 

Ana. : Only that for both ••A^* and ••B^^ their ••real^^ tlaea 
wete aomewhere above 9.75 aec. (If not, the atopwatch 
would not have made It past 9.7 to the 9.8 actually re- 
corded), and somewhere below 9.85 sec. (If not, the needle 
would have ••slipped on*^ to show 9.9 rather than the re- 
corded 9.8). Whatever the exact times were, we can aaaume 
that Turtle B*s was sllgjhtly better than Turtle A^s, but 
we probably would have needed a watch <^llbrated to 1/100 
to ahow It. N 

Thus, anytime a continuous measure Is esqpressed In terms of 
dZdCAete points^ the result Is gapa between poaalble acores. 
Halfway Into the gap la a point whl^h dlVidea one poaalble 
score from another. Given a poaalble acore, the dividing 
point below It ia called its lower limit s irtilU that abote It 
la called Its upper limits Together, these two dividing 
polnta comprlae the real limits for the given score. 



w u 34 



CEhlnli about tItc»ov« for « alnute or two, and then try the 
folloviag ezaaples. Cawr the aniwera ea you go ao chat you 
can teat youraelf on the queatlona preaented.) 

IQ acorea are expreaaed aa Integer valuea auch aa 100, 95, 
127, etc. Wiat are the real ll«lta of a acore of 97? 

Ana.: 96.5 (loner) and 97.5 (upper). Note that theae aeparate 
the 97 froM the next poaalble loiier and higher acorea, 96 and 
98, rea^ectl^ely. ^ 

Wtat are the real, llalta of a acore of 1007 
Ana.: 99.5 (lover) and 100.5 (uppbr). 

Gaaoline la aold to the neareat 1/10 of a gallon. What are the 
zeal llalta of 10.6 gallona? 

Ana.: 10.55 (loner) and 10.65 (upper). Hote, for exa^le, that 
10.55 equally dlvldea the recorded 10.6 fron the next poaalble 
lower acore, 10.5. 

What about 20.1 gallona? 

Ana.: 20.05 (lower) and 20.15 (upper) « 

•Meat la w«lg)ied to the neareat 1/100 of a pound. What are 
the real llalta of 1.29 Iba.? 

Ana.: 1.285 (lower) and 1.295 (upper) 

Suppoae seat la wvlgbed to the neareat 1/1000 of a pound? Give 
the llalta of 3#296 Iba. 

Ana.: 3.2955 (lower) and 3.2965 (upper) 



Having trouble? Note that for the last exanple the next pos- 
sible lowest wei^t would have been 3.2*5 (retneinber, the nunhers 
can change in thla one b5^ 1/1000). Isn't it true that a 3.2955 
ia halfway between 3.295 and the recorded 3.296? If you don t 
aee thia, add a aero to the two possible scores, which is per- 
missible, as the «ero will not change their values: 

3.2960 (recorded) 

3.2955 ... ■■- 
(pldpolnt 3.2950 (next poaalble lower acore) 

In gap) ' 

Thua, the 312955 la the recorded acore'a lower ll«lt. By the 
aaM logtc, 3.2965 would be deternlned aa Ita upper Unit. 



20 

3o 



1.1 POSTTEST 



1. Suppoae we are lieMurIng the length of pencils to the nearest 
indit Hhat are the real llslta of scores of 10 In. and 5 In. 7 



2. Reaction tliw la aeaiiured In 1/1000 of a second. Wiat are 
the real lialts of 1.372 sect 1.300? 



3. .Speed In a race Is Measured to nearest tenth of a second. 
Miat are 'the real ll«lts of scores of 10.1? 11.0? 



See answer section for correct answers. If you alssed any, 
reread Instructional unit (or check Yellow Pages for a statis- 
tician on call) . 



21 



3o 



1.2.1 in«trucfalon«l Unit ; Ungrouped Frequency Dletrlbutloni 

SutfQse you are a teacher who has given a test to 2b 
students. You spend all night grading the papers and 
when you are finished you end up with the following scores: 

• 1 8 10 4 

10 6 3 

3 7 4 7 

9 2 9 3 

6 5 2 8 

You could leave the scores In a random arrangement as 
Illustrated above, but this would provide you with very 
little Idea of how individuals performed and how the 
class did as a whole. A much better procedure would be to 
construct what is called a freqtiency distribution. The 
use of a frequency distribution puts the data in a form 
that makes sense and answers most of the general question^ 
you might have regarding what occurred on the test. A ( 
•frequency distribution of the above data might look some- 
thing like this: 



X 


f. 


cf 


10 . 


1 


20 


9 


2 


19 


8 


2 


17 


7 


2 


15 


6 


2 . 


13 


5. 


1 


11 


4 


2 ■ 


10 


3 


3 


8 


k 


2 


5 


1 


2 • 


3 



7 



0 1 1 

n^20 

It should be iifflmedlately obvious that the frequency 
distribution puts the data into a taor* manageable form. The 
"X" column lists all the possible scores; the "f" column 
tells us the frequency (or number) of the individual 



•coru obtained; and' the fif colun Indicates the cuaulatlve 
frequencies of the scores. For ezMple^ looking at th^e st 
for a score of 3^ ve know that 8 scores were obtained equal 
to or less than 3.5 CreM»be1r fros Ihilt 1.1 that the upper 
llslt of a score of 3 would be 3«5). 

The steps Involved In constructing a freqiihicy distribution 
are fairly slwple: 

Im rind the bluest and lowest values. In the exaaple the 
hl^ value Is 10 and the low value Is 0. 

2. Set up the X coluan going froa the hl^ value at the top 
(10) to the low value (0) at the bottoa. Include all po8«* 
slble scores In the eoluan; for exaaple, even^f no sjtu^ 
dents earned a score of "3" on the test^ jou would still 
usuallj want to Include the category "3'* In the Z coluan. 

3. Starting at the top (I.e. » at the score category, 10) 
tally how aany tl«es each score occurred. This value will 
be recorded In the f colun. 



4. Once all the f^'s are recorded, the next task Is to flll^ln 
the cuBulatl^ frequency (cf) coluaei. The cf for a partl«- 
cular score Indicates the nuiA>er of frequeneles below that. 
SQore. Tou construct the cf colusn by starting at the 
bottoa of the distribution and building one row (l«e., 
score) at a tlae. The^ first score In our ezaaple (on the 
previous page) Is "0". The £ f or ''0** Is 1, aeanlng that 
one persA obtained a 0 on the test. Jlaw^im^y scores have 
accumulated at the **0'* category? Only 1 (the saae person) 
therefore, cf Is also 1. Moving to the nekt score, "I", 
we find an JF of 2 (two people scored a "1" on the test). 
Bow Bsny scores have accuMulated there? ^ 

Ans. : 3 (the two people who scored and the one person 
who Scored "0"). 



So, you proceed by row, adding the frequency for the row^ 
score to the cf for the previous row. The result Is a cf 
for that row« Check ^hls out by exanlnlng the cf listing 
tot the score, *'5^ Mote that '*5" has an f of 1. Its cf 
of 11 was obtained by adding Its frequency (I.e., 1) to 
the c£ for the score below It (gf for "4" - 10)'. Thus, we 
can say that ^*11'* (10 4 1) frequencies haveN^uMlated 
through a score of **5**. 

Vow It is tlM to throw a wmkey^wcench of sorts Into the 
conceptual «achlaexy« If you are observant, you «isr l^rve 
noted that our description of how to derive cf did not 
exactly^ conform to the definition given In the second Bmor 
tence above; l#e#, cf « the nuiiber of frequencies below a 
certain score* Turn once again to the e^a^ile and ezaslne. 




for Instance, th« tabulation* for the acore, "4". It la 

tctw that 8 people clearly acored beloy "4" (checic the c£ for 
"3") and 2 people acored at.**4*'; can we really aay that 10 
people acored below %»'t Wo I lhat la becauae cf la really 
auppoaed to be aaaodated vlth thf upper llait of the acorea. • 
Ihua, checking the exaaple (thla la the laat tl«e - Pro^") . 
we can aay that 1 peraon acored below ".5" (the upper Unit 
for ••0") ;3 people acored below "1.5" (the upper ll«lt for 1 ); 
5 people acored belOw 2.5; and, aklpplng a few. 17 people 
acored below "8.5". 

Queatlon: What acore point do all 20 atudenta fall belowT 

Ana.: 1£ yciu aald "10" you were doae. A better jipawer would 
be "10.5" Veeanae that»a the hl^eat poaalble ll«lt for total 
acore dlatrlbutlon. Hone of the 20 people could have been 
•above that without neceaaltatlng the addition of a higher 
acore (X) liatlng on the dlatrlbutlon. ^ 

Before leaving thla aectlon, ttake aure you underatand the 

following: ^ \ 

(a) fa are aaaoclated with the actual acore llatlnga (X a; 
on the dlatrlbutlon. ^ > 

' (b) (rf'a are aa^^ated with the upper ll«lta of thoae llatlnga. 

Cloae your eyea and aay that three tl«ea. and your book and 
atatlatlca courae will forever dlaappear. 

(Boole atlll around? You were auppoaed to aay "tJutC <fclk€ 

5. Add up the f colum to find n^ The ayrf»ol "n" lajiaed In 
atatlatlca to Indicate the total nuiiier of^corea. 



.2.1 POSTTBST^^aiaawera In back) 

f! i \ 

Conatnict a frequency dlatrlbutlon for the following data. 
Be aure to Include a cf coluan In the table. Alao detemlne 



.10 


.30 


.30 




.20 


.00 


.10 


.30 


.10 


.80 


.60 


1.00 


.70 


.90 


.90 


.00 


.«0 


1.00 


.90 


.10 


.40 


.60 


.90 


.70 


.40 


.50 


.90 


.60 


.50 


.20 


.60 


.50 






1.00 


.20 



24 



- final notft : The *bove description* pertain to 
ungrouped frequency dletrlbutlone 'because no taaperlng 
has been done to ^he else of the orlg^l units. AlM,|^ 
every number still oalntalns Its own Identity after 
the frequency distribution h»»* been aade. 

Instructional Unit ; Grouped Frequency Distributions 

/ Many tines It' Is Impractical to attempt to list 
every possible score In a frequency distribution. This 
may especially be. the case when scores take on many 
dlffer^t values. Thus, In order to simplify the ' 
recording g/xd Interpretation of results, grouped \ 
frequency distributions are often employed. ' 

Th^e are no set rules for the constructlbn' of- 
grouped frequency distributions, but the folloirlhg 
guidelines are generally followed whenever possible: 



(1) The use of 8-15 Intervals -Is fairly coiiiion. The 
larger the satnple of cases , the larger number of 
Intervals you ©ay wish to employ. ^ 

(2) Intervals of the same width (1) are usually 
l^refeXred.s 

(3) The Interval limits should be chosen so that the 
Interval midpoint tends to corresfyond with the 
observed values* ^ 

^ More specific rules are as follows: * ^ 

(1) Order the scores from highest to lowest.^ 

(2) Find the approximate range orTipores by subtracting 
^ — <the^4i>W9St score '•from the liifghest score. 

/ 

*(3X Divide the range inta' equhl toilts so that you »nd ujJ 
with 8-15 intervale. For example-. If your ^ghest 
score Is 100 and your lowest score Is 30, the range 
would be 70. You therefore iltght decide upon 8 
Intervals of 9 unite, 10 Intervals of, 7 units, 11 
or 12 Intervals of 6 units » etc. V 

<4) Hake eure your lowest score will be place4 within ^ 

the lowest interval^ For example, if °"5teJ°*!,"- 
i V score was 30 and we decided to use Interips of 9 
units, we might start with the interval, 26-34, 
followed by 35-43,- etc. You may note that irtien the 
Interval width la odd in nfaiber (auqh as 5, 7, 9, 
etc.) the midpoint of the interval vlll lie a iihole * 
^ number (mldpoinfi' of 26-34 would be 30, midpoint of 
35-43 would be 39).' Thue, odd-numbered interval 
widths are usually desirable^* 



(5) Complete the . recording of Intervals frop low to 
high, and tabulate the fr'equencjr of scorda thai; 
fall into each. ( 



3. 



(6) Then record cumulative frequencies (cf) as you 
did for ung^uped frequency distributions. 



Exalople problem: 



Cinstr^ct a group frequency 
distribution for the following 







data: , 




45 


52 


75 


47 


64 


90 


42 


57 


73 ^ 


74 


59 


81 


49 


58 


87 


90 


69 


72 


46 


63 


70 


86 


79 


56 


80 < 


75 


65 


75 


87 


49 


.50 


67 


90 


85 


84 


83 


68 


69 


76 


56 



High score is 90, low score is 42; range is therefore 
approximately 50. Using intervals 5 units in length 
would yield 10 separate intervals (which would be 
fairly desirable). 5 is also an odd number, and 
therefore the midpoints of t^e Intervals will be 
vhole numbers .(again, a desirable feature) . 

Select lot^est interval making sure that it will in- 
clude the lowest score. interval of seems 
sufficient for a start. ' 

Build up from your lowest interval until you reach 
the point where you can accommodate the highest . 
score (in this case 90) . Do not worry If you end \xp 
with one interval more (or less) than you originally 
planned. 

Tabulate frequencies and cumulative frequencies as 
follows: 



» 



26 



X 


i 


si 


90-94 


3 


40 


85-89. 


4 


37 


80-84 


4 


33 


75--^9' 


5 


29 


70-74 


> 

4 


24 


65-69 


5 


20 


60-64 


2 


15 


55-59 


5 


13- 


50-54 


2 


8 


45-49 


5 


6 


40-44 


1 


1 




n-40 





Final note ; The ptnrpos^Jof using grouped fr^>fWiy 
' dlstrlbutlid^s is to condense data. ^ 
Keep in mind that the example ^ 
frequency distribution, shown above, 
is capable of accommodating all 
I possible scores. Take, for instance, 

the lowest interval in the distribu- 
tion— 40-44. What are the real limits 
. of that interval? The answer is 39.5- 
44.5 (seiB Unit 1.1 if you are uncertain 
about this J. What are the real llAits 
of the next highest interval, 45-49? 
Answer: 44.5-49.5. So, if by some 
chance, ye obtained a score of 44.32, 
in which interval would it be repre- 
sented? Answer : We would place ±V In 
the interval 40-44, since it falls 
< within the real limits of only that 
interval. Similarly, «ny score you 
could possibly imagine between 39.5- 
94.5 (the real limits of the lAole 
distribution) falls wlthlil on^ «nd only 
*! one Interval in the distribution. What 
might give you a little more difficulty 
is the idea of cunulative frequency. 
Notice on the distribution that we have 
,a cf of 1 for the intervHl 40-44, a fii 



of 6 for the Interval 45-59 » etc. 
Understanding how we obtained those 
values should not present much of a 
problem, but try this; To what 
specific score does a cf of 1 irefer? 
A reasonable guess would be "42/* the 
midpoint of the corresponding Interval, 
40^44. A reasonable guess In this 
ca^e Is not a correct one: We really 
cannot assume that one score^'accumu- 
la ted** that was less than or equal to 
42. The score could have been a 43, a 
43.57, a 44.05, etc. But there is one 
thing w^ positively can assume In 
talking about our cf, of 1^ and that is 
that the score was less than 44.5, the 
upper limit of the Interval. If it was 
not less than 44.5, It could not have 
been placed within that interval. 
Think about it. (And remember that 
44.5 is also the lower limit of the . 
interval 45-49). 

Now try this: To what hypothetical 
score would we assd^iate a cf^of 13? 
Looking at the distribution, we imme- 
diately realise that t:he 13th score 
fell within the interval, 55-59. Is 
the hypothetical score therefore 57, 
the interval inldpolnt? No, we 
cannot be certain that the 13th score 
wis less than or equal to 57 (it might 
have been 58, 59.2, etc.) But, we can 
be absolutely certain that the 13th 
score is less than 59.5 (the upper 
limit of the appropriate interval); 
our answer therefore should be 59.5; 
One more for practice: We can be 
absolutely certain that our 24th score 
falls below what hypothetical cutoff 
point? Answer: 74.5, the upper limit 
of the Interval 70-74. 



FOSTTEST (answers in back) 



Construct a grouped frequency distribution for the 
following data: 



01 
31 
16 
3S 
14 
10 
12 
15 
36 
OS 
09 



07 

fo 

01 
36 
30 
08 
01 
14 
69 
01 
20 



40 
18 
07 
OS 
07 
36 
36 
30 
10 
07 
18 



5 



1,3.1 Instructional Unit ; Frequency Polygons 

A frequency polygon Is used to provide a pictorial 
or graphic representation of/ your frequency distribution. 
It Is extremely valuable In that It gives yrit» or your 
reader a quick » clear picture of the dl^strlbutlon of 
scores. The procedures Involved In constructing a 
frequency polygon tte fairly simple: 

(1) Draw a horizontal axis, and label Itt'^X," It will 
be used to represent all possible scores. Divide 
thf X axis Into equal units; each unit will corres- 
pond with one of the points on your score scale. 

(A good length for tlie X axis Is 3" to 5"). 

(2) Draw a vertical axlsi and label It "f It will 
be usf^ to show the frequenc;^ of your scores. In 
dividing the f axis Into equal units start at sero 
and build up to the highest f value in your fre- 
quency^ distribution. You will want the f axis to 

^ be approximately the same length as the X axis. 

Therefore, it is not necessary to include every 
possible f value on your graph; you may wish to 
build up by 2»s, 5*s, 10*s, etc.— ^t ever seems 
reasonable considering the amount of scores with 
which you are dealing. 

(3) Once the axes are constructed and labeled, place a 
dot directly abcri^e each X value equal to its 
corresponding £ value. 

(4) Finally, connect the points together and bring both 
en4 points down to where the next X value would be 
on the'X axis. ' 

Example problem: Construct a frequency poiygoti for 
the following data: 



10 


5 


32 


9 


4 


27 


8 


6 


23 


7 


8 


17 


6 


4. ' 


9 


5 


0 


5 


4 


1 


5 


3 


2 


4 


2 


1 


2 


1 


1 


1 



The resulting frequency polygon Is: 




X 

Final note ; The procedures for constructing a fre- 
quency polygon oi grouped data are 
essentially the same but there Is one 
major difference. Examine the , 
frequency distribution shown In Unit 
I.2.2. Looking at the various 
Intervals, which scores do you plot? 
Unlike^ yihmt was discussed to be case 
for cf (where It was decided at the 
' conclu84.on of 1.2.2 that the vsppBt 
limit of the Interval comprised the 
^ a desired score), this time we are 

looking for the score that Is most 
representative of the whole Interval. 
If you are thinking **mldpolnt" this 
tlme<, you are entirely correct. So, 
the only real difference In plotting , 
grouped data is that the various 
Interval midpoints will be represented 
on the X axis. 

Example problem: Construct a frequency polygon for 
the following data: 



30 



90-99 


10 


44 


80-89 


6 


34 


70-79 


5 . , 


28 


60-69 


11 


23 


50-59 


1 


12 


40-49 


0 


11 


30-39 


5 


11 


20-29 


3 


6 


10-19 


2 


3 


0-9 


1 


1 




X 

/ 

Final note: Notice that when we are dealing with 
groupe4 data^ the Interval mldpolnta are used on the 
X axla. Alao note that graph la anchored to horlaontal axla at 
polnta preceding and beyond the acore labela, 

P08TTB8T (inawara in beck) 

Conatruct frequency polygene for the following dla- 
trlbutlona; 



X 


f 


cf b) 


X 


f 


cf 


25 


.1 


19 


24-26 


4 


45 


24 


2 


18 


21-23 


12 


41 


23 


3 


16 


18-20 


9 


29 


22 


6 


13 


15-17 


10 


20 


21 


3 


7 


12-14 


1 


10 


20 


2 


4 


9-11 


3 


9 


19 


1 


2 


6-8 


2 


6 


18 


1 


1 


3-5" 


4 


4 



< 1.3.2 iMtnictlonal Unltt Hlitogr 




allAxnatiir* wajr of gr^&las f requtncir data la via 
a hlaCSfraa. Biatograaa are virtuallT ithe aaae aa JChe 
frcquencjr polygona wa hava juat covered, except that ver- 
tical bare, extending In width frAi the lover Halt to 
the upper llalt of the acore or Interval, are uaed Inatead 
of 'connecting Unea. Ihe helfiht of the bara,> coeparable 
to the placeaant of the dot In the frequency polygon, 
correaponda to the frequandea of the acorea or Intervala 
aa labeled on the vertical axla. Make your bare even In 
width; be aura to uae the' aldpolht of the acore or Inter- 
val, poalttoned In the middle of the bar, aa your horl- 
sontal axla label. 



Conalder the following frequency dlatrllyit Ion: 



Z 
48 
47 
46 
45 
44 
43 



f 

5 
15 
25 
30 

'20 
25 



cf 
120 
115 
100 
75 
45 
25 



The reaultlng hlatograa would look like: 
30 



25 



20 



15 



10 



43 



44 



45 



46 
Z 



47 



48 



Wote that' the loweat vertical line la the lower Unit 
for the acore of 43 ("42.5) and the uppenoat vertical 
line la the upper llidt for the acore of 48 (>>48.5}. 



32 



The procedures ere the eame tot constructing hlsto- 
grfss of grouped date. Just make eure that the midpoints 
of ihm "bars" correspond to the midpoints of the Intervals. 

1.3.2 POSTTEST (answers In back) 

Construct histograms using the data provided In 
P06TTIST 1.3.1. 

1.3.3 Instructional Unit : Cumulative Frequency Graphs 

Cumulative frequency graphs show how many observa- 
tions fall below a given score or class Interval. There is 
nothing "tricky" about plotting cumulative frequency. The 
vertical axis of the graph Is labeled "cf" with the various 
points used to represent the cumulative nunober of observa- 
tions. The horizontal axis may still be labeled "X" (as 
In the case of frequency polygons and histograms) but this 
time the points on the X axis will be expressed In terms 
of the upper limits of the scores. For review, think 
about this and remenber It: In plotting the cumulative 
frequency of a class (or score), all observations are 
associated with the upper limit of the class (or score) , 
not the midpoint. 

Consider the following frequency distribution: 



X 


£. 


cf 


35-40 


10 


49 


29-34 


6 


39 


23-28 


5 


33 


17-22 


.9 


28 


11-16 


11 


19 


5-10 


8 


8 



The resultant emulative frequency graph would look 



like: 



33 



4d 



cf 




Final coiments ; Notice that the upper limits of 
the various score Intervals ate 
used on the X axis. Also notice 
that the higher end of the graph Is 
not brought down to sero. Fi]^lly» 
notice that the steepness of ttie 
slopes correspond to the SMunt of 
7 Increase from Interval to interval: 

When a large number of observations 
fall Into a particular score inter- 
' valf the slope becomes steeper. 

1.3.3 POSTTEST Unswers in back) 

Construct cuDulatlve £requency graphs for the data 
presented In POSTTEST 1.3.1. 



34 



ERIC 



48 



1.4 In»trnctloiul Unit: Fonu of Frequ«icy Distributions 



ERIC 



Check back to Unit l.\3.1 and examine the two e^tan^e 
frequency polygene. If you are observant you will tiptice 
that they are not ldentlcaJ| In form. Rarely » In fact, 
will two frequency iH>lygoi9l ever be Identical since the 
fora of each la determined by the nu^er of obaeryatlona 
recorded for the specific score points or Interv^s. 
Being able to characterise the form of a frequency 
polygon (and in essence the form of the frequency 
distribution) Is very advantageous In reporting /diita, 
especially If space does not permit Including t^ actual 
polygon In your pap^r. Generally » there are tt^Vee types 
of characteristics that you should become faml^ar ^th. 
These are as follows: 

I 
i 

(1) Shape - frequency polygons can normally tie described 
In terms of the ''general" shape in which fthey appear. 
Some common exampleii are: 



a) 




b) 



c) 



Bell-shaped 



Rectangular 



d) 




X 

U-shaped 



Triangular 




X 

J-shaped ^ , * 

(2) SvMietry - another distinction can be made relating 
to whether your frequency polygon la syetrlcal. A 
' distribution Is syoBetrlcal If one half of It can be 
folded over so that It Is exactly auperl«q>oaed over 
the other half. Dlatrlbutlons that are not dis- 
tinctly synaetrlcal are characterised as being 
akewed. Skewed distributions are labeled by the 



35 



0« 50 



I) 



direction In which the longer tall Is poj.ntlng. 
When the tail points to the. right » %ie^ay that the 
distribution is positively skewed , Tli^ means that 
there is^a preponderance of "lower" scofes and rela- 
tively few scores in "hlgh&r"- intervals. When the 
tall points to the le£t» we say that the dlstri- 
hutlon^s negatively skewed. This means a prepon- 
deranc9o£ "higher'^ scores and relatively few 
"lower" scores. Think about these descriptions 
in examining the examples shown below: 






Bell-slf 
symmatricar 



triangular 
symmetrical 




d) 




nonsymmetrical 
positively skewed 



nonsymmetrical 
negatively skewed 



(3) Modality A final distinction between frequency 
|>olygons can be made on the basis of the number of 
modes. (peaks or high points). A unimodal distri* 
bution has only one relatively high point , whereas 
bimodal distributions have ti/o jfdgh points or peaks. 
Similarly, a distribution with three peaks would 
be trlmodal . What does it mean to your reader when 
you describe your distribution as unimodal » for 
example? If he is bright and competent in his 
knowledge of statistics » he immediately knows 
thatt in your distribution, there was a fairly 
high concentration of scores within one partic- 
ular interval. If you tell him your distribution 



36 



is blmodal, he knows that two score Intervals • 
(usually separated to some extent) were relatively 
popular, etc* 



Consider the following: 




Bell'-shaped symmetrical 
unimodal 




b) 



Rectangular symmetrical 
amodal (no Modes) 



d) 




Positively skewed 
Blmodal f 



Negatively skewed 
unimodal 



Final note: It is not always necessary to actually 
draw a frequency polygon in order to characterise the 
form of the distribution* For example, consider the 
following very simplified frequency distribution: 



9-10 

7-8 

5-6 

3-4 

1-2 



1 

2 
3 
2 
1 



cf 

9 
8 

6 ' 

3 

1 



Just by looking at the distribution we should feel ^ 
confident in characterising its form as follows: a) bell- 
shaped, b) symmetrical (as many high scores as low 
scores), and c) unimodal (really only one mode— interval 
5-6) . ' ^ 

iJtf you are uncertain, quickly plot the above data, 
and read the above characterisation again* 



FOSTTEST (answers in back) 



Characterise the fdrm of the following distributions 
in terms of generM shTape, symmetry, and/ modality (Note:^ 
bifflodal distributions typically are u*-shwpad-~no need to 
say both)* ^ \ 



Canimata In baefc) ^ 



The MlALnistrator of an adoleicent center llata the agea of 
her 20 cllMta aa follova: 



17 


17 


18 


16 


17 


18 


19 


17 


IS 


16. 


17 


16 


17 


16 


19 


18 


17 


18 


15 


17' 



Hprk with theae data by doing the following: 

' 1. ConatnKt a frequency dlatrlbutlon of acotea, llatlng Jf, cf', 
* and n. Why would a grouped dlatrlbutlon be Inappropriate? 

f 

2. What are the real llalta of a ac</re of 18? 

3, What acore point would- be aaaoclated with the hlgheat cf? ^ 



If you were to i^at^Ti^cf graph, what typea bf acore polnta 
would be Med to define the horlxontal axla? 

5. What wftuld be the arfipe of a frequency polygon of thlf 



Otatrlbutlon? 



> 



39 



UNIT n 



4 



, , PERCEKTtLE RANKS AMD 
CENTRAL TENDENCY 

General Ot)jectl-9^i 

The preceding unit dealt with methods that are comnonly 
tiaed to facilitate the organisation and Interpretation of 
group data« In addition to ''suttmarizing** data hj use of fre^ 
quency diatributlona, polygons » etc., it Is uafually desirid>le 
for the statistician to provide the reader vlt^ more specific 
information regarding the meaning of individual scores ; that 
is 9 hoir individuals performed in comparison ^^o others. One 
of the general objectives of the present unit involves fa- 
miliarizing the student with percentile ranks a descrip* 
tive index fi^elluently exqployed to describe relative standing 
(individual scores) oi) aptitude an^ achievement tests. 
Another general objective is to facilitate the interpretation 
of group performance by presenting tliree numerical indices of 
'^average'* or "typical" achievement - the mean , the me^an , 
and the mode . 

Specific Ob;1ectlves 

2.1 Define percentile rank ^ 

2.2 Compute percentile ranks for specified raw scores 

2.3 ponvert specified percentile ranieis into raw scores 

2.4 Define central tendency 

2.4.1 Define' and compute means 

2.4.2 Define and compute medians 

2.4.3 Define and compute modes 

2.i Select the appropriate relative positions of the mean, 
median, and mode in skewed and non<*skewed dlstrlButlons 

Instructional Unit t Percentile Ranks 

A percentile rank Is a fausiber thdt' represents the percen- 
tage of scores that fall at ot below some selected raw score, 
A percentile rank of 80, if or example, means that a certain 
scove has 80% of the other scores equal to or below It. In 
other words. If the measure Involved was a mathematics 
achievement test, a percentile rank of 80 would mean that 
80% of those who took the test scored at or below the 
selected score. A percentile rank of 50 would be a score 
where 50% of the sco'res f all ^below that point. We can sym- 
bolize the reporting of percentile ranks by P^, where X repre- 
sents the particular percentile In question; for example, 
the percentile rank^of 80 would be symbolized as Pg^, the 




55 



percentile rank of 36 would be symbolized as P36. Thus, 
when we say that John scored at P67, this can be taken 
to mean that John scored 1;h0 same as or higher than 67% 
of the students in his class (tor people taking the sane 
test). . 

Do not confuse percentile ranks with actual scores. 
John may earn a score of 80 correct on his basic algebra 
test. Depending^ upon how other people in the class per- 
formed, John's score could be reported as P20» ^L^f Pgot 
etc. That is, the percentile rank fpr a score of 80 will 
be determined according to what this particular *8Cor-e 
means in relation to the class performance as a whole. 
If a score of 80 was higher than 50% of the remaining ^ 
scores, then the percentile rank would be PSOJ if it was 
higher than 95% of the remaining scores, then the per- 
centile rank wo61d be P95, and so on. 



\ 



41 cr - 

u Do 



2.1 POSTTEST (answers In back) 
a) Define percentile rank: 



b) What does it mean when we say that a student performed 
at P89? 



c) John took a math test and scored a 96 which was reported 
to be Pj^« Later that day, he took an English test and 
earned a score of 42. The English score was found to be 
P95. On which test did John do better relative to the 
other students in his class? Why? 



f 



2.2 Instructional Unit : Computing Percentile Ranks 

The basic formula for computing percentile ranks is 
_JS^ where cf^ is equal to the cumulative frequency of the 

scores in question ^and N is equal to the total number of 
observations or scores. In Unit I we defined cumulative 
frequency (cf ) , and showed how it is presented in tables and 
graphs of frequency distributions. You will remember 
(hopefully) that cf shows how many observations fall below 
* a given score or class interval. The cf of a class (or 
score) is found by adding the class (score) frequency and 
the frequency of all lower classes (scores). As we move 
to higher and higher scores on our frequency. distribution, 
cf will likewise become higher and higher. 

Now look again at the formula for computing percentile 
ranks; cfx* All that it means is that if you are interested 

in finding the percentile rank for a particular score, you 
must find the cumulative frequency for that score (that is, 
cf^) and divide it by the total number in your population 
(that is, N) . By the way, if we are using 8an4)le d a- ^a^ ngt he 
to tar number of scores is symbolized as n; if we are using 



population data, the total tiuober of acores la aynbollzad as 
51. In either caac, the procedure for computing percentile 
ranks is the same. 

Let's try a very simple example. Fred earns a score of 
56 on his basketweavlng competency examination. Fred is 
concerned about the meaning of his performance, and al^o • 
about his grade. We cannot make too many inferences about 
his grade (the teacher may be very nasty or very generous), 
but by computing the percentile rank for Fred's score of - 56, 
we can, at least, tell him the percentage of people la the 
class that scored lower than he did. We know that there 
wer^ 100 people taking the basketweavlng competency exam; 
therefore N=100. When we order the scores from lowest to 
highest (as in a frequency distribution), we find that the 
rank of Fred's 56 is 38; therefore cf^^ is approximately 38. 
Using the very simple computational formula, we find that 
Fred scored higher than 38% of the students in his class. 
His performance was at P38. Simple I 

In a class K»f 30 students, Mike earns a score of 87 
on a geography final. Mike's score is higher than that 
received by 28 other students. Using the formula, we 
substitute 28/30 for ££x, and find that Mike scored higher 

than 93Z of those taking the test. His P.R. is therefore 
P93 (approximately). 

Hopefully by now you see no difficulty with using the 
formal^, And even more hopefully, you fully under- 

N 

stand the concept behind the formula; find out how many 
people you beat on the test or measure, divide that by the 
number of people who took the test, and you get .a percentage 
Multiply that percentage by 100 and you get a percentile 
rank. So technically the formula for percentile ranks Is 

^ X 100. 

Now that you p.robably feel somewhat confident about 
your ability to compute percentile ranks, we will throw a 
"monkey wrench" of sorts into the conceptual machinery. 
The monkey wrench relates to the computation of cfx, for 
if our percentile rank is to be accurate, our estimation 
of cumulative frequency for the. score in question must also 
be accurate. In the preliminary examples involving the 
Aa^ketweavlng and geography scores of Fred and Mike, we 
have been using approximations of cfxt and thus, th6 
.resultant percentile tanks of P38 and PgaVre also approxi- 
mations. Examine the following frequency distribution. 



43 



-5o 



10 
9 
8 
7 
6 
5 
4 



2 
3 
6 
2 
5 
3 
1 



,22 
20 
17 
11 
9 
4 
1 



« 



8iqipo«e you are >shoim the above distribution by a worried 
student In the class. He tells you that he received a 
•core of 7» asd ts uncertain about hoir he perforaed rela<- 
tlv« to cfchers. Since you successfully coapl^ted Unit I 
in basic statistics, he regards you as an expert In inter- 
preting test data. You feel extreaely confident; not only 
have you coapleted Ihilt I, you also learned something about 
percentile ranks in Unit 2.1. Tou knoir that percentile 
ranks do provide valuable inforaation about relative per* 
forunce. Turtheraore* you know the fornula for coaputing 
percentile ranks: cfx> Tou salle, because giving this 



student infoxaation sbout his relative perforaance is ob- x.. 
vlously a v«xy slaple aatter. 

Tou are ready to begin the staple conputatlonal process • 
M obviously equals 22 p the nuaber of observations^ (people 
taking the test); cf^ for a score of 7 isppears to equal 11^ 

(that is, 11 people scored vithln or below the score cate«- 
gory, 7). Tou divide 11 by 22 and get .50 ^ the proportion 
of people nfio scored below your friend on the test* Tou 
waltiply by 100 and t^port this as P50; you tell ypiir friend 

the was exactly ."average" in his perfomancee Balf the class 
was lower and half was higher than he was* Tou feel very 

* accoaplished and brilliant, until another friend who has 
coapleted Itait II in basic statistics tells you that you were 
wrong In your calculations* Iffiy? 



The aistake involves selecting 11 as ypur cf,^ for a 
score of 7* Tou will reaeii^erp perhi^ps^ that 11 refers to 
the for tljie upper liait of the score value of 7< Hius, 
we can say with certainty that 11 people scored bellow 7.5 
(the upper liait); 22 people scored below 10.5; 4 scored^ 
below 5a 5 9 etc. 



A Before proceeding with the e9q>lanationp you should be 
e aware that you are likely to becoae very confused unless 
you have sastered the following concepts froa Unit I: 

real liiiits g looking at the above frequency distribution 
do you understand that a score W^) listing of, say 8, 
really refers to an Interval encoapassing. possible scores 
ranging froa 7.5 to 8.5? If you don*tp it*s tiae to turn 
backa 




5J 



(^) fr#att»<ar dtatrlbuttcm : loofclnf again at tha l^ove fre«- 
quancT dtatrllKitlont do youimdaratand that tha ^'X'* 
coluHi Itata tha acoraa that paople could Tacalrap tha 
^*f^** coluan talia you hov mmrxr of thoaa acoraa vara 
actually obtalnad, and tha **cf" coluan tall* you hov 
mmf acoraa fall haloir tha uppar llttlta of tha apaclflad 
X»at 

(c) cuaulatlva fraguancy : aa Juat aakadp ara you comfortabla 
with what cf raptaaanta? 

If your anamr to any of tha above la **nop** tha only way you 
vlll aurvlva through thla objactlva la by neaorlalng tha for-* 
moLmp kncwladga that «ay laat for a coupla of daya and than 
dlaappaar fron your vanory f oravar. A battar procadura would 
ba to turn back to tha appropriate Inatructlonal aactlona, 
review the concepta that got a ''no" reaponaep and then tackle 
the following through vaanlngfulp rather than rote, learning. 

Getting back to tha problem at hand: If your friend 
acored a 7 on the teat (aee the frequency dlatrlbutlon) , why 
would It be Incorrect to aaaume that he acored higher than 11 
people p and therefore atanda at P5Q (11/22 x 100)? The anawer 
la that a cf of 11 la aaaoclated with the upper Unit of the 
acora value , 7. ThuB, a cf of 11 la aaaoclated with a acora 
of 7.5 not 7,0« By the aa»a reaaonlngp a cf of 9 la aaaoclated 
with a acora of 6^5 (the lower llolt of 7). DO NOT PROCEED 
UNTIL TOU CAN SEE FROM THE FREQUENCY DISTRIBUTION WH THE LAT<- 
TER IS TRUE. Thua, all that we know about your friend 'a atand-- 
Ing la that he 'lieat'* aoaeirtiara between 9 and 11 people p bb 
hla acora of 7 la aoMwhera between 6,5 and 7.5 • Repreaented 
aa an llluatratlon of aorta: 

X (acoraa) cf (nuabar ••beaten") 

7.5 Is Msodated with 11 people 

7 (the score of Interest) Is associated with J? 



6.5 Is associated with 9 people 




The problea Is that although we know that a score of 7 was 
hl^er than at least 9 other scores, we cannot say exactly 
where It stands In teraa of a percentile rank. You will note 
on the frequency distribution that 2 scores fell In the Inter* 
val for "7" (I.e., 6.5 - 7.5), but were they higher or lo^ier 
than 7? They could have been (theoretically) 6.96 and 7.49; 
6.51234 and 6.7; 7.001 and 7.36; etc. You will note that all 
of these possibilities would have slaply been recorded as "7* . 
So* since we can never know for certain, we mist estlvate, and 
please try to follow this conceptually: we assune that the 
scores' %ssoclated with each Interyal are evenly distributed 
within those Intervals; half fall above the nldpolnt and half 



45 



fall below the Htdpolntp end so on. If 2 scores ere recorded * 
for the score point 7p we essuas that one iras greater than 7 
and one was less than 7 Creaenber 7 is the midpoint of the in** ^ 
tervalt 6«577.5), Notice on the table that 6 scores are asso«* 
ciated with the score point 8, All we can really assume about 
these scores Is that they are evenly distributed in the inter-* 
val 7«5 to 8.5 ^ three faU below the midpointp 8p and three 
fall above the i^dpointp 8, Getting back to the original 
problem of determining the cf for a score of 7: How many * 
scores fell on or below the midpoint,, 77 Since we are uncer^ 
tain about two scores p we must use a process called linear 
interpolation whidi will estimate nhere those questionable 
scores would fall considering: 1) the else of the interval 
(in this case » 1 unit long - from 6.5^7.5); 2) the tmber of 
questionable scores (in this cassp two). In us;Lng linear 
interpolation » we assume that all scores are evenly distributed 
along the interval with which they are associated. The linear, 
interpolation formula we use for finding cf^ is presented 
below: 



Where X is the scorci in question 

TL^j^ is the lower limit of X's interval 

i is the length of the score interval for X 
cf is the cumulative frequency of ,X 

(what we are actually . trying to find) 
cfjj^ is the cumulative frequency ^of the lower limit of X 

is the frequency of scores associaUd with the 
interval containing X 

At this juncture 9 sll of this may seem extremel/ confusing^ 
so let's backtrack a hit before working on. the equation. Our 
objective p you will recall » involves 'the computation of per- 
centile ranks. We know that: 

1. The formula for coqputing pei^centile ranks for given 



scores (that is, X's) is cf^. '^VM, if we know the values 



cause N refers to the nuaber of scores (observirtiens , X'sp 
etc.) in our frequency distribution. 



3. Wk msy have a problem^ however ^ in determining^f^ be-* 
cause our frequency 'distribution only telld us the cumula** 
tivm frequency of the upper limits for the various score 
intervals. Looking at the example frequency distribution 
(turn back 2 pages) we know that the cf,^ for a score of 



i 





> 



7.5 it 11$ «« know that the cf, for a icore of 9,5 la 
20, etc« But we cannot be aure about the cf, for 
**iiidpolnt'* acoxaa euch aa 4,5^6,7,8,9, and 10. 

4. Thexefore, In order to eatlaate cfx» we miat go 
throu^ a proceaa called linear Interpolation. The 
particular equation we uae la : 

X - Xjj^ - cfx - cfix 

All of the varlablea In thla equation are defined on 
the preceding page. Notice that one. of the varlablea 
la cfjj. Ihua, if we can aubatltute values for all of 
the other varlablea we should be able to determine 
cf^c for any given acore. 

5. Once we find cf,, we divide it by M (aee step 1), 
multiply by 100, and we get our percentile rank. 

Ut*a go back to our original problem: find the percen- 
tile rank for a acore of 7. Check back to example frequenc7 
diattfibutlon. 

. i: The equation we use la c£^ x 100. 

N 

2. H " 22, vhlch is the nuid)er of acorea in our diatri- 
butl'on. Therefore, Percentile Rank - cfx/22 x 100. 

3. cf, for a acore of 6.5 la 9; cf, for a acore of 7.5 
la 11. We have no direct indlbatlon of the value bf 
cfx for a acor« of 7 (which la what we need). 

4. We must Interpolate. We begin by writing down the 
equation and aubatltutlng valuea. Remember, we are 
looking for cfx. 

b) X - the acore in queatlon 
therefore, X ■ 7 

therefore, 7 - X^^ " *^^x " *^^LL 

c) Xu. - the lower limit of X'a Interval 
therefore, X^^ - 6.5 

therefore, 7 - 6,5 - cfx - cfu. ^ 



i 



d) 1 ■ tbci Itngth of the «cor« Interval for X 
tbttrefortL, 1 - I C7t5 * 6.5) 
tbereforCf 7-6»5 • oS^ - cf^ 

1 fw 

e) cf|j^ - the ctamlatlW^ frequency of the lover 
U«lt of X. 

therefore, cfu, - 9 CLoner llalt of 7) 

therefore, 7 -'157S' • cf- - 9 
- 

1 f« 

f) - width of X's interval 

therefore, f w " 2 (2 scores fall between 6.5 > 7.5) 
therefore, 7 - 6.5 - cf^ - 9 
i 2 

g) Work through equation: . 

.5 - cf, - 9 

2 

1.0 - cf jt - 9 
cf,-10 
5. Detenlne percentile rank: 
cfx/N X 100 - 10/22 X 100 
- .45 X lOd 
-P45 

Try to believe that all we did In ualng the Interpolation 
procedure was: 

a) deteroine that a score of 7 was half the dUtance 
between 6.5 and 7.5, the real Units of the in- 
terval in question. 

b) conclude that therefore, half of the two ecores 
associated with, the interval 6.5-7.5 should fall 
below the score value, 7. Ealf of 2 is 1. (hope- 
fully, you agree with this). 

c) ve added the one score to the cf for the lower 
lisdt of the interval ending up with cf , " 9 -i- 1 

■ ifi.' 

The above description is intended to show that there is 
nothing "•aglc" about the interpolation procedure. Bxan- 
ine the interpolation equation and see if you can under- 
stand its logic. If you can, you will probably experience 
little difficulty with this unit. 



48 

63 



eonptttt the pftr^til* rank for a fcota of 9^ 



1. P.IU - c£ /N X 100 

2. 11-22 

3. cf ■ 17 for 8.5, tiia lower Italt of the acore interval, 9^ 

i • ^1 - ^ 

Xll - 8.5 - 17 

1-1 £v - 3 

Stibatituting: 



9 - 8.5 - c£^ - 17 



1 3 

1.5 - cfjj - 17 (Mote: We atibtracted 8.5 froa 9, and then 

cf^ " 18.5 croaa-aultiplied the 3 to get 1.5 on the left.) 

5. p.R. - 18. 5/22 X a:oo 

P.E. - 84 

Anaver - 'Pg^ ^ v,.^ 

Notice that all we did hj' interpolating vaa : 

a) Find that 9 waa 1/2 the die tance between 6.5 and 9.5. 

b) Deteralne 1/2 the aeorea aaaodated with the interval, 
^ 6.5- 9.5 (which waa found to be 1.5). 

c) - Add 1.5 to 17, thOrCf of the lower Halt, ending up 

with cf^ - 18.5. 

mumlM trdkUm i Tbm exact aaaa procedurea are uaed in de- 
terainlng percentile ranka for grouped data. Itotng the data 
provided below, coapute the percentile rank for a acore of 16. 



X 

U-23 


f 


cf 


T 




18-20 


8 * 


95 


15-17 


20 


87 


12-14 


6 


67 


9-11 


10 


61 


6—8 


25 


51 


3-5 


}3 


26 


0-2 


13 


13 



49 



1. p.*. - ctjm X 100 

2. V « 100 

3. fit - 67 for th* value 14.5. th« laier llult of the 
Interval Into which 1£ falls > 

4. X - - cf^ - cfLL 
~ U, 

X - 16 Cthe score In question) cf ,j - ? 

XL1.--14.5 • cfj^L-" 

1 - 3 (notice that the Interval f^ - 20 Notice that 

14.5 - 17.5 is 3 units 1-20 for the 

Interval 14.5 « 
17.5) 

Sidistitutingt 16-14.5 - cf^^ - 67 

—3 20 / 

30/3 - cf, - 67 

5. P.*. - 77/100 X 100 • 
Answer is Pyy 

Note that what was done can be illustrated as followst 

X (score Intervals) cf (nu*er "beaten") 

17.5 is associated with ... .87 people 

16.0 is associated with .... 7 

14.5 is assocUted withiJ . ... 67 people 

By interpolating we found that 16 was 1/2 of J*^. f*;'««^« 
between 14.5 - 17.5. Wfe therefore took 1/2 of the scores 
to^that interval (that is 1/2 of 20) and added the. to the 
cf for the lower lUilt (that is, 67). The result was a 
3, of 77. 



50 6\ 



BacMplfc f gobl— » lifting th* icm data; detexalne th« P.R. 
for a acoia of 6. 

1. P.«. - cf^ s 100 

2. M - 100 ( » 

3. cf - 26 for the lomx llalt of Interval aaaoclated 
« with 6* ' . 



X - 6 cf, - ? 

»LL - 5-5 cf^ - 26 
1-3 . f ^ - 25 

Subatltutlng: 

6 " 5.5 - cfji " 26 
3 25 

4.17 - cf, - 26 

ci, - 30.17 

5. ll'.R. - 30.17 X 100 
100 

P.R. - 30.17 

Anmer: P^q ■ 

An llluatratloo for thla one would look like: 

X • si. Ctt«M*er "beaien") 

8.5 la aaaoclated with .... 51 people 

6.0 le asaociated with . .' . . T 

5.5 ia aaaoclated with . . . .26 people 

Since the acore of "6** la doaer to the lower, than to the 
upper, llait of the interval, the latccpolation ptocaaa will 
ahow ita cf to ha doaer to the lower li«lt*a cf of 26. If 
you underatand thia, you have com a long way. Idea a two* 
■Inute ^ater break aa a teward Ovre exciting thinga wKf be 
proaiaed later). 




51 



Hon •^dflcally, thli tiwt by Interpolating, w 'ound that 
a acot« of 6 waa 1/6 of the d^tance Between 5.5 and 8,5. We 
thetefote took 1/6 pf the acorfa la that Interval Cthat la . 
1/6 of 25) and added the« to the c£ for the lower ll«lt Cthat 
$M, 26). vlhe reanlt waa - 30.17. 



\ 



52 6' , 



2«2^)p0STTOST (ariwers In buk) 



For each of the following eete of data, coapute petcentlle 
ranks for the raw acorea Indicated. 



c) 





Set 1 


• 




. Set 2 

« 






f 


cf 


X 




i 


cf 


SO 


8 


38 


^18-20 




2 


25 


49 


9 


30 


15-17 




4^ 


23 


48 


7 


21 


12-14 




8 • 


19 


47 


4 


14 


9-11 




3 


11 


46 


3 


10 


6-8 




3 . 


8 


45 


3 


7 


3-5 




4 


5 


44 


2 


4 


0-2 




1 


1 


43 


1 


2 










42 


1 - 


1 










score of 


49 


•* 


a) scpre 


of tl 






score o£ 47 ^ 




b) score 


of 9 






score of 


44 




c^ score 


of 7 







2.3 ' Instructional Unit ; 
Scores 



Converting Percentile Ranks ints Raw 



In the preceding unit, we learned how to convert raw 
scores into percentile ranks. That ls» If a student told 
us what score he earned on a test* we would ba able to 
use the frequency data for that test to coapute his 
percentile rank (and thereby » tell hia the percentage of 
classaatM~«hat he "beat"). But suppose that,; after a 
teat or yieasure has been administered, we are 4aked to 
Indlcatewhlch scores would correspond to various 
percentile cankf, such as, P25, ^47. P66» P9q^ 
exaapSt^ friend (or even an enei^ aight not know hit 
score on' Mi kXMM, but be intereiited in knowtog what scora 
he would neai ilt order to surpass two-thirds of the 
student* in hit i^atM In other words, he it asking, 
"Wiat score corretpondt to P^jl** If wa had the fre- 
quency data for hit cxaaitittion, finding the raw fcore 
equivalent for a given percentile rank is fairly easy. 
In fact, it slsply involves the "oppoaite" process froa 
findtog the percentile rank for a given raw score. Sup- 
pose the test results are as follows; 



X 


± 


' cf 


10 


5 


50 


9 


3 


45 


8 # 


7 


42 


7 


8 


^ 35 


O 


A 
*♦ 


97 


5 


8 


^. 


4 


6 


15 


3 


4 


9^ 


2 


'3 


5 


1 


2 


2 



We are Interc^sted, reneisber. In determining the raw score 
equivalent for Fgy. 

1. First, put down the only ba^ic formula you know of 
that relates percentile ranks to scores in^ the 
distribution: t.R. « cfx/N x 100, Then put down 
down the interpolation formula: X - Xu, * c^x ^^LL 

Keep in mind that we « are looking for the valup^ of 
"X" this time. In the last unit we were tryug to 
compute the value of F.R. Since we are looking foi: 
X, the interpolation fprmula seems like a good pl^e 
to start — it includes X in the equation, and if we 
can substitute values for all of the. other variableift, 
finding X should be a relatlvely-^liiiple endeavor. 

2. Although we would like to start substituting values 
in the interpolation equation, we really canpot get 
very far. We don't know the value of X and therefore 
cannot possibly determine values for Xij,, c^LL. fv> 
etc. It appears that we are at a standstill, out 

.there is iBomething we can do which will enable things 
to fall into plare; we can determine the value of cf^* 
How? By use ^f the original equation: F.R. cf^/N x 100 

so, we substitute: P.R..» 67 (that. was giveit) 

N «^ 50 (the number of scores) 

therefore: 67/100 « cf^/SO ^ ^ 

.67 - qfx/SO 

33.5 = cfx 

for convenience (this time only) %fe will round off 
' and sfty. that cf^ =34. 



ERIC • 



54 



3. Notf we are ready to try the interpolation equation 

again. If we can substitute values for all variables » 
we can easily compute X. 



r 



i ^w 

X » ? (This is what we are looking for.) 

cfx ' 34 we determine this in step #2; we now know that 
X is a score that has a cumulative frequency 
of 34T*looking at the distribution we can 
further conclude that X will fall within the 
. interval 6.5 (£f=27) to 7*5 (£f=35). It 
appears that it will be closer to 7.5ybut 
how close? 

Xll ^6-5 How do we know this? Since cf^ 34, X 

must fall between 6.5 - 7.5. The lower limit 
of X*s interval is therefore 6.5. 

i - 1 The score Interval asjsoclated with X (6.5-7.5) 
is 1 unit in width. 

cf^^ - 27 The cumulative frequency of the lower limit 
of the interval (6.^) Is 27. 

f ^ - 8 The frequency of scores in the interval » 
6.5-7.5, is-8. 

It looics like we go,t everything. Now let's find X: 

X - 6.5 « 34 ^ 27 y ^ 

1 8 : 

X-6.5 » 7/8 ^ 

X » 6.5#»- 7/8 ^ ^ ^7"^ 

X » 7.18 . I 




^4 7.^8. . 

H > 11 ^ 

xne ^ame frequency distribution find 



We are now idoking fqx the $core Associated with a P.R. of 

25: ■ , ... i ^ 5t-^'";- 

•%.l %nr^own the basic P.R. equation; put dowh the inter- 
poljEt^lon equation. 

' 2. Find cfx by use of basic P.R. equation. 

^ P.R. - c£x/N X XOO 

25 - c£x/5Q X 100 ' ^ ' ■ - 



.:^5 - cf^SO 
cf^ * 12.5 (this tlite we won't round oft) 
3. Use interpolsfeion >^uation to find X. 



i £^ y 



X - ? 

cfx - 12.5 (see step #2j 

3*5 c^x - 12.5, X inuat.be a score between 
I 3.5 and 4.5)^ f . 

1-1 (the interval, 3.5 - 4.5, is 1 unit in width) 

/ " ^ ' 

cfLL ■ 9 (the lower limit, 3.5, has a c£ of 9) • 

^ ^tt,' ' ■ - 

f„ -6 (6 scores fall between 3.5-4.5) 

siibstituting and working through: ^ 

X - 3.5 - 12.5 - 9 

^1 6^ 



X - 3.5 - 3.5 
6 

X - 4.08 
4. Our, answer: ^25 * ^'O^ 



Example Problem : The procedures are identical when we are 

dealing with grouped data. 

Given the following data, find Pqq. ' 



75-79* 

70^4 

6^69 

'60-64 

55-59 

50-54 

45-49 

40-44 



t 




2 


65 


9 * 


63 


15 • 


54 


11 


39 


10 


28 


7 


18 


8 


11 


3 


3 



Write down the two equations you will need. 
Find cfj(. 

P.R. = cfx/N x'lOO 
.80 - cfx/65 
c£x^ 52 

Use Interpolation equation to find X. 

■ 1 f ' * " ' . ■ ' 



cf„ - 52 

XtT =,64.5 (if c£„ = 52, 'X oust be a score between 
6A.5 - 69.5) 

1-5 (the interval 64.5 - 69.5 is 5 units' In wldt;h) 

cfj^j^ = 39 (the lower limit, 64.5, has a c£ of 39) 

f„ = 15 (15 scores. fall between 64.5 - 69.5) 

substituting and working through: 

X - 64.5 - 52 - 39 
5 15 

X » 68.83 . , 



Our answer: Pgg " 68 . 83 



2.3 POSinST Cnuwcn in back) 

tot the following seta of data find F30. P^g, and P^q. 



new 1 


















CI 




A 








1 

, *■ 








4 




47 


1 


39 




15-17 


5. 


31 


46 


3 


38 




12-14 


6 


' 26 


45 


4 


35 




9-11 


10 


.20 


44 


10 


31 




6-8 


^ 


10 


43 


8 


21 




3-5 


3 


6 


42 


6 


13 




0-2 


3 


3 


41 


5 


7 










40 


2 


2 











2a 4 iMtructlonal Itett: Central Tendency 




I Percentile ranks give us an Indication of how an 

Individual perfomed relative to others. Aside from 
being Interested In Individual petformance, however, 
we are usually concerned with "how the group did as a 
whole. Frequency dlstrlbutltniS^--c£t course, 4o supply 
us wJ±h Information concerning gtodb performances, but 
thtiiPiforutlon Is difficult to ciflMmlcate to our 
readers. A good wqr to precisely summarize group 
achievement would be to deter^ne the score that Is. 
most tifplcal or representative of all scores In our 
P frequency distribution. We call these typical or 

representative scores «easures of central tendency . 

A measure of central tendency Is one of several 
types of averages. An average Is a score that Is 
typical or representative of a group of scopes. 
Three of the most' commonly employed central tendency 
measures ate the mode ^most frequently occurring score), 
the median (the middle score), and the mean (the 
*'average"^score).' Most Importsntly thougli sny measure 
of central tendency Is supposed to Indicate a "repre- 
sentative" score value for the group being evaluated. 

2.4 POSTTEST (answer Is provided W)ove - see 

^ 1€ you csn define iJt In your own i^Hs) 

Define Central Tendency [ [ 



\ 



ERIC 



58 



73 



mm 



I Infctructlonal Onlt ; Coaputatlon of ttoum 



Th* aoat ccmmoqIj eaplcjed aeMuw of central tendency 
is the aean - •ytlbolised X. It 1« obtained by adding 
together all the nuaerlcal veluee and dividing by H, the 
ttUBber of velues. You have probably need thie procedure 
mmf tlaea In coeputing your average exaa score in high 
school and college classes. , By definition, the aesn Is 
the nuidier iihere the dcvietllpns sbove it equal the devletlons 
below it. Ihet ie, if you sWtrectedT fro« every' nuiber sbove 
end below T end added the deletions together - the two stM of 
deviations would be equel. \ 

The basic fonuls for the mtti^ is: 

X - rx (if you sre uncertain about "sunnatifm" 

H notation/ check beci^k to review on erithaetic 

operetions) | - / • 

All the fonnile aeans is: Add !j up all your scores; then 

divide by the number of scores ; 

Exaaple Problea : Given the fo||^ing scores, deteraine 

the aean. \ ^ ' 

Scores: 10. 8. 7^6. 11. 20, 5 \ 

X - EX EX - 67 

N ^ , • 

; . M - 7 

X - 67/7 

X- 9.57 (two dedasl eccurecy is sufficient) ^ 
Eaw^le Problea : Coapute the aesn of the following scores: 
X 1 cf 

25 13 30 

24 0 17 

23 6 17. 

22 3 11 

21 5 8 

20 2 3 

19 1 1 



9 



59 



74 



Again » the basic foraula la - 7 

* N - 

All you have to do la add the acorea and divide by N 
<i9hlchyou know to be 30). No tricks are involved, but 
a very connon aiatake would be to atart adding the 
acorea as followa: 19 + 20 + ,21 + 22 + 23, etc. -Why 
would this be a fld.8take?-**--after all, we are adding the 
acorea to find £X. Maybe an important clue would be 
provided if it waa pointed out that the proper adding 
procedure would be abnething like thia: 19 4- 20 + 20 
+ 21 + 21 + 21-1-21 + 21 + 22, etc.^ Now, you probab^jt/ 
see the light. * The "X'* column Indicatea (aa alwaya) 
the possible scores, but the column tells us how 
many of those scores ^re actually obtained. It looka, 
aa though we will end up with a rather long addition 
problem in order to determine the correct value of £X. 
Fortunately, howevef , there la an easier procedure which 
is clearly appropriate when we have more than one acore 
of the same value* The procedure (which you ahould 
definitely Ipam and try to understand) Involvea: 

Multipl;y.ng f times the midpoint of each Interval. 
In the ab^Ve ungrouped distrl^bution, the midpoints 
are sinpiyuthe values listed: 19, 20^ 21, 22, 
etc. (remember,- the value 22 la the midpoint of the 
Interval 21.5 « 22*5). Multiplying X x f accounta 
for all the numerical values in the lnteriral» 
Thus, for efCh score point, we will end up with an 
additional colunii indicating the value of fX. 

Sum up all the fX*s and divide by N. The reault 
will be your mean. 

^Thus, whenever^we have a frequency distribution, 
a more manageable formula for computing the mean 
la: JX - ifX 
N 

Let* a work through the formula 8tep-by**step. 
Our frequenc3(^U8tributi|Ai: ^ 

(note that to facilitate matters, 
we have added an "fX" column, 
which ahows the producta of the 
varioua scores times their 
frequencies. At the bottom, 
we add all the fX*a reaultlng 
in ZfX.) 



z. 

3. 



X 


i 


.cf 


£X 


25 


13 


30 


325 


24 


0 


17 


0 


23 


6 


17 


138 


22 


3 


11 


66 


21 


5 


8 


105 


20 


2 


3 


AO 




1 


1 


19 






Zft 


-693 



60. 



ERIC 



Now all we do is work the basic forouU T - ££X 



T * 693/30 
T - 23.1 



Note: the formula EfX is/f^ly the same as EX. When we 

N / N 
have many values of thel same score, Ejffi facilitates the 

computation process. 
Example Problem ; Compute the metm of the following data: 



X 


i 


X' 


fX* 


24-26 


1 


25 


25 


21-23 


2 


22 


44 


18^20 


8 


19 


152 


15-17 


4 


16 


64 


12-14 


6 


13 


78 


9-11 


3 ' 


10 


30 


6-8 


1 


7 


7 


• 














EfX' - 


• 




„ 't 


J - 








T> - 



T;> - 400/25 - 16 

The above problem may appear, at first, to be 
somewhat complicated, but the procedures are really quite 
simple. Sine* we are dealing with a frequency distribu- 
tion, Che formula neisded fbr cbn^uting the mean is 
EfX. It would take too long if we decided to add flvery 
N 

score Individually, so we multiply each score by its 
frequency, and add the fX's to yield our EfX (or EX). 

.The only difference between the above problem, and 
theWevious problem, is that the present ^^"'^'^ 8""?*** 
Thui, we need a "representative" score for each interval. 
The most 'representative score anyone could choose would 
naturally be the interval midpoint. To avoid confusion, 
the Interval midpoints have been labeled as X and 
included on the table. 



ERIC 



61 



7o 



In suanary. In a grouped frequency distribution, the 
midpoint of the Interval (ayid>oll2ed X*) becowa the X 
score which, In turn, muat be tailtlplle^p^ the 
corresponding £ value. Since the f ^#alue Indlcatea, how 
nany nuiobers are In the Interval, niultiplylng X tines the 
midpoint (X*) accounta for all nunerlcal valuea In the 
Interval. Therefore, the formula f or D! using grouped 
data would be: 

If • rfX' - 400/25 - 16 
N 

Thla procedure la really Identical to thtjf: employed In the 
previous problena~the only change Involves locating the 
midpoint of each Interval. Note that the midpoint of the 
score Interval 15-17 Is listed as 16. This makea sense 
since 16 falls exactly halfway between 14.5 - 17.5, the 
real limits of the Interval. 

Concluding note: The Important thing Is. that regardless 
of whether ungrouped or grouped data are being usedLp^ll 
numbers (all f) must be accounted for. Either you^add 
them Individually^ (many t^s aH Impractical endeavor) 
or multiply "X" by the appropriate ^ value and then add 
up the fX's. ' - 

POSTTEST (answers in back) 
1. Define^ Mean 



2. For each of the following sets of data, compute the 
mean. Round off to two decimal places if « necessary. 
(£f has been left out of the tables^ since it is 
really not needed — ^you will«need to add f 's to com- 
pute N, however) 



SET 1 

X i 

18-20 10 

15-17 12 

12-14 4 

9-11 3 

6-8 1 



SET 2 


1 


V 


f 


QS 
7^ 


1 




1 




1 


ft? 


1 


84 


1 


76 


1 


59 


1 


45 


1 



SET 3 

X f 

10 3 

9 4 

8 7 

7 5 

6 2 

5. 1 



f 

Instructional Unit ; Computation of the Median 

tne median, another measure of central tendency, Is 
the number that corresponds to the middle frequency (that 
is, the twiddle score) in a ranked set of data. The median 
is the value that divides your distribution in half; half 
of your scores vill be higher than the median, and half 
will be lower than the median. 

X 1 
High 

50% 

Median ' ■*-' 

50% 

Low ' , 

It is important to ifemember that the median is the halfway 
point in the distribution—in terms of< frequencies (it 
corresponds to N/2). For example if U-40 (meaning th»t 
you have 40 scores) , the median will be your 20th score 
(in teriM of rdnk); if N-17, the median jwi?,l be your 
8.5 highest score, etc. 

Another way of dief 1/ning the. median is to say that it 
corresponds to Pso* 

that corresponds to a percentile rank of 50; it is higher 
than 1/2 the scores, and lower than 1/2 the. scores. . If 
you have mastered Unit 2.3 (converting percentile ranks 
into raw scores) then coaputing medians cannot possibly 
present any difficulty— all you do, given a group of scores 
or lb frequency distribution, is find the raw Bcoi3|#<or Psq. 
The procedures you mastered in Unit 2.3 are fully applic- 
able for coaputing medians. 



In ordtr to conput* aedlins, ua« th« following guidelines: 

1. If jou have en odd nuaiber of ecoiee^ with no repeete 
(ell f*a • 1), the P50 Interpolation fonnile la not 
needed. All that la rcqulnd to find the aedlen Is to 
rank order four acorea froa loweat to highest; the middle 
score will be your aedlan. 

Bxeaple Problea ; Find the aedlsn Qid) In the following 
eat of date: 26, 24, 23, 28, 31. 

Ksnk order the acorea : X 

31 
28 
26 
24 
23 

Mhmt is the alddlt score? 

at • • ^ 

Ms. Ud m 26 

2a If jou hsVi m even niuAsr of scores » with no repestSt the 
Ud Is 1/2 the dlstsnce between the two middle scoreji. 
' Thus, without worrying sbout interpolstion, rsnk order the 
scores fros low to, hi^, find the two.alddlf ones, end 
then detenlne the point hslfwiqr betireen thfm. The letter 
step slaply Involves svereging the two. 

gxeaple Problens : Find the Md In each of the following 

seta of date: 



SET A: 13, 92, 68, 79, 81, 27 
SBT.B: 205, 198, 146,VJ61, 190, 105, 149, 269 
Rsnk order the scores: 



^ET A 


SET B 


92 


269 


81 


261 




' 205 


68 


198 




190 


"O3 






146 




105 



Whet ere the two alddle scores? 



64 



torn* 68 and 79 In S|T A; and 190 and 19fr In SET B 

Ud - 68 ■» 79 In SET A; and 190 ■» 198 In SET B 

2 2 

Md m 73.5 In SIT A; «id 194 In SET B 

If you hav« a large set o£ data^ and/or one In which 
ftequendea are greater than do not use the ^'al^ift- 
cut" procedures ^described above. What you should %o 
Is treat the nedlan as P50 (which It always Is) and 
follow the identical procedures described In Unit 2.3 
for converting percentile ranks into raw scores. 







aet of 


X 


f. 


cf 


56 


3 


20 


55 


2 


17 


54 


1 


15 


53 


2 


14 


52 


5 


12 


51 


4 


7 


50 


3 


3 



We are looki||| for the nedlan - that la, P50 



a. Put down the basic P.R. equation; put down the 
Interpolation equation 

b. rind the cf^ associated with P50 by use of the 
P.R. equation \ 

p:r. - cf,/N X 100 . 

50 - cfx/20 X 100 

.50 - 6fx/20 ^ ' ' 

cfx - 10 (We have now determined that the nedlan will 
be the 10th acore In the dlatrlbutlon) 

a A ^; -44. ^ -ML M. JiU. i4 -^^ -i. »», ^ ^ 

c. Uae the Interpolation equation to determine the 
nedlan (X) 

X-Xll • cf x-cf LL ^ 

X- T ^ . ■ / . . ^ ^ 

cf, " 10 (aee atep b) 



65 



I 



X,, - 51.5 Cif cf« - 10, X Bust be a acore betveen 
51.5 - 52. 5y 

I 

1-1 Cthc Interval 51.5 - 52.5 Is 1 unit In width) 
cfix ■ 7iCth« lower llalt 51.5 has a c£ of 7) 
£^ - 5 (5 scores fall betwee|i 51.5 - 52.5) 

sid>atltutlng and working through: 

. • *. 

Median (or X) ~ 51.5 ■ 10 ~ 7 , 
1 5 

Median - 51.5 - 3/5 

Median - 52. 1 ' ^/ 

d. Oui^anawer: Pjq (or Median) - 5^.1 ^ ^ 

The acore 52 i 1 la the dividing point In our 
dlecrlbutlon. 

' '■' , » ■ - ■■■■■ -U-^*-'* 

* EiemU Problea: Coapute the oedlan for the follo^ng 

' aet of data (thla problem wlll#« 

worked fithout aa mich deacrlptlon; 
if you are having trouble vlth pro- 
cedurea, reread Unit 2.3) 



£ 


f 


cf 


72-74 


6 


30 


N 69-71 


10 


24 


66-68 


5 


14 


63-65 


6 


9 


60-62 


3 


3 


cf^ aaaodated i^h i 


X-Xll - 


cfx-c^LL 






fw 


X - 68.5 


m 


15 - 14 


3 




10 



3 • 10 
X - 68.5 • .3 
X - 68.8 
Anaver: Medlw" 68.8 



ERIC 



66 



Si 



2.4.2 miTIST Omc«»rs lob«c;k) # 



!• DeflM the Mdlm 



— ^ ^ ' \ Tl^ 

2. For each of tjie fdllowlng acta of data, eoapute (he 
* aedlan. 4 J 



Set 1 


Set 2 




Set 


3 


.Set 


4 
















X • 


X 


X 


f 


cf 


* 11 


cf 


12 


68 


12 


2 


17 


30-32 2 


24 


10 


65 


11 


4 


15 


27-29 7 


% 


9 


62 


10 


3 


11 


24-26 6 




8 


60 


9 


6 


8 


21-23 4 


9 


r 


S9 


8 


1 


2 


« 18-20 2 


5 




54 


7 


1 


1 • 


15-17 2 














i2-14 1 






Ins true tl oiul Unit : Determination of the Mode 

The third and last measure of central tendency you will 
be asked to learn Is the mode l The mode Is the number that 
occurs most frequently In a set of data. There Is no ^ 
fortEuila for the mode~lt Is obtained by simple Inspection.t 
If there Is more than one number that has the highest 
frequency ^l*e.» the highest 1 value Is shared by 2 or 
more X values) / then there are 2 modes; l.e.^ the 
distribution Is bimodal* 



Example Problem : Identify the mo<le In the following s^t 
/ , of data ^ / 



101 ' * 4 

101 > „ . , / , ^ \ \ : 

100 the mode Is iQQ^ -slnce 100 occurs (three - 

100 times) more than any other 

100 r nuid>er. tn a |sense» 100 

98 . - * . Is the most "popular" 

89 , ^ score. ^ ^ 

Example Problem : * Identify the mode in the following 
set of data »* 



Here, both ind ;8 ai« modes since.. ^ 
they occur seven times— the highest 
frequency values. "6" and "8" were 
the most "popular" stores. < 

* » 

Identify the mode in thcf following 
set of data 



X. 


I 


10 


3 


9 . 


h 


8 


7 


7 


6 


6 


7 . 


5 • 


4 


4 . 


2 


''Example Proil 


em: 




£ i 


20-22 / 




. 17-19 ( 




14-16 


19 


11-13 


18 


/ 8-10 . 


24 


5-7 


16 * 


2-4 , 


Z 



In a grouped frequency distribution » 
look for the interval VitJh the highest 
r value and then use the midpoint of 
that interval i 
s 

the distribution* pn the left^ the 
_idal interval is «-iP (7.5 - 10.5) 
sihe«A^ ^s the highest fj^ue (24) 
-% * * 

^ the mode is 9, (the Interval 



POSTTEST tajMwersLlBJimi?^ 
Define t hj^^^de 



2. Identify the mode In each of th^ following sets of 
data* 

Set 1 Set 2 Set 3 

X i - 1 • X 1 X ' 

. - 64 ' I 1 40-42 14 7 

63 3 37-39 6 7 

62 ^ I 4 34-36 3 7 

* 2 ^ 31-.33 /2 6 

60 6 1 28-30 4 ■ 6 

59 ■ 4 ^ 25-27 ' 1 6 

' 58 6 , • ^ 6 

. 57 2. ♦ 4 

4 

Instructional Unit ; Comparisons between Means, Medians, 
ahd Modes In Skewed aild Non-Skewed" Mat ribut Ions 



Wheneve^^ possi^e. It Is alwajrl^eslrable to report 
ail three measures of central tendedlcy^slnce they provide 
different kinds of Information: ' ^ ^ 

1. the Mean Is the scqre point at which the 
distribution ba).anceis^ ^ 

2. the Median Is the score point th^t bisects the 
ar6^ y 

3. the Md^e Is the score point with the highest * 
freqlency 

') * " 

In general, however, the mean provides the most* useful 
i^asufe o£ central tendency. This is because the mean has^ 
cert£ft.n vmath^^tlcal properj^s and can beSised for a 
variety of further calculations. The mcxde Sm the\4.east 
useful mea9ure'<--lt Is not very sensiMve tpAarlafllons In i., 
your distribution. The median is usually not preferred 
relative to the mean, but thete are two special cases where 
it should be used: The first ±8 a case where the mean 



canijpt be calcul^ted^ the secoiS^ is a case where the median 
act^^lly gives a better Indication of centicel tendency. 



CASE 1 - Ua^the median If a acora interval* la open-ended 
Salary i ' * 

25,000-over 19 . 

• 20,000-24,999 23 , * , 
15 ,000- f9. '999 96 . ' • 
10,000-14,999 42 / 

5,000- 9,999 31 

Ordinarily In a grouped distribution » the mean would 'be 
calculated by multiplying the midpoint of e<ch Interval 
(X*) by the corresponding frequencies (f). However, In 
the above distribution shoving the frequency of various 

• salaries » the midpoint 'bf the uppermost Interval Is 
unknown. Therefore /'we cannot flnd."fX*^' for that In- 
terval the mean cannot be calculated. The median can, 

^ however, by use of. the P..R. and ^Interpolation equations. 

CASE 2 - Use the median when the group of s^res contains 
^ extreme values 

^ Uhat do w« mean by extreme values? In sliq>le terms, they 
would encompass spdres that are located a considerable 
distance, from where most of the scores seem^to cluster^ 
Remember the concept of skewness discussed In the final 
section of Unit I? To refresh your memory, we can have 
a symmetrical distribution where tberiii Is no skewness , 
a distribution with a 'positive skew, oV * distribution 
with a negative skew. As can be seen %i the f requency 
polygons below, the skew represents a ^'tall** o^ sorts, 
encoiiq>assliig'scoreiB that are relftlvely far from the 
^ cluster at the opposite (high or*tow) en4. ^ 




6k&/0 




Thus ,^ the rule Is that when there Is a serious skew, 
either positive 4' negative, a more representative meas- 
ure of central tendency can probably be found In ^he 
median as opposed to the mean. In the same sense, given 
a listing of scores that contain some extrjeme values, 
the median Is likewise the better^ choice. The frequency 
polygons shown on the next page give some Idea of how serious 
as opposed to slight, skewing would be represmted* 



70 




X 



X 

(Se/tcoaiT 



A. 



iSJUQht) 



The distinction between serious and minimal skeirs should , 
be easy td memorize^ but the serious (as opposed to the^ 
minimal) statistics student would probably want to know 
why extreme scores dictate the median. Here Is ai^ exas^le: 
five meti (or women) visit a faplly practice clinic to 
establish their eligibility for receiving health services. 
The questionnaire that «they are asked to fill out contains 
a space for them to list their yearly family Income. The 
amounts listed are as follows: $10,000{ $12,000; $13,000; 
$i5,000; and $400,000* (a former politician who was con- 
victed of accepting bribes and has written a best seller' 
describing his despair). Which would be the "better" 
measure of central tendency, the mean or the median? 

If we calculate the mean, we get: EX ^ 450.000 ^ $90,000. 

» N ■ - 5' " 

If we determine the median by sellntlng the middle score, 
we get: $13,000, / 

Which would be the better measure for the. clinic to base 
Its estimates- of "typical" financial status on', the mean • 
or the median? The median clearly seems better In this 
Instance. Why? The data show a serious positive skew 
(many "lows"; "few highs") . ^ 

Here Is another example. A teacher gives a math test to 
students and gets the following results: 



90-99 
80-89 
70-79 
60-69 
50-59 
40-49 
- 30-39 
20-29 
10-19 
0-9 



3 

4 

3 

0 

0 

0 

Q 

0 

0* 

2 



cf 

12 
9 
5 
2 
2 
2 
2 
2 
2 
2 



ERIC 



71 



8o 



He l8 initially very pleasedt since as he records the 
grades In his book he notes that nearly all of his 12 ^ 
students did quite 4irell. He expects a high clasjs average 
(mean) » but when he computes it by the formul^ X • EfX * 

he gets 854 ^ 71.17 * very disappointing result as he had 
12 * • 

used his Very^ best teaching^^chnlques 1^ presenting the 
material. In his despair» he considers the possibility of 
giving up teaching and enrolling in truck-driving school » 
but then he notices the two zero scores recorded for the 
test. This explains the snoring he heard during the exam 
period. As a foftaer statistics student » he calculptes the 
ttedlaa through the formulas we have discussed: 

P.R. - cf^/N X 100 

.50 cfjj/12 

6 - cf^ 

through Intezpolation : 

Ud - 79.5 - 6-5 
10 4 

Ud - 79.5 - 1/4 X 10 

Ud' 92 

« 

l>ue to the extreme scores » the median in this case QAd - 82) 

S'ves a better picture of the typical student performance 
an does the mem (X " 71.17). Knowing thiSt the math 
teacher rests easy and even considers signing up to teach 
sdiool. 




/ 

Before leaving this section^ we should point out how the 
mode may often be the most misleading central tendency 
statistic of all. -Suppose we wish to assess how much money 
students carry around with them. We question the first five 
students that ve meet and find that they have^ respectively: 
$.05 (a measly nickel) » $.05^ $2.00» $3.85» and $4.05. In 
this instance^ both the mean and median^ wo^ild be $2.00 
(can you detertiifie why?)^ while the mode would be 5c» hardly 
"representitlve" but still the most frequent score for the 
group. See the problem? 

V. . - - ' ■ 

GoncludlngHot^: The mean is usually thci^best measure of 
central tendency to use except when (he distribution is 
ppen--etided or has extreme scores (i.e.» is seriously^ skewed) . 



72 



Th« akcimeaa In « dUtributlon affect* irtiather the nean and the 
Mdlan are approxlwitely the aaM or different In that dlatrl- 
btttlon. If th« frequendea bunch up toward the •iddJ* of the 
acore acale or are f alrlx evaily apread out along the acore 
acale* then. there la little or «er<? akeir. In a sero akeir, the 
aein and aed^an a]ce the* aaM. 




X 
Ud 



Mote: X and Md are both the samel 



/ 



6.3 
6.2 
6.1 

6.» 

5.9 
5.8 



2 
3 

4 _ 

. 4- X * Median 

4 

3 
2 



ERIC 



.73 



If there Is a positive skew (frequencies bufich up at low 
end), the niean tends to shift toward the* trailing oft end 
(positive) of the distribution. However, the mdlan ^ 
remains relatively unchanged. Therefore, In a positively 
skewed distribution, the mean Is larger than the median • 




EX « U 

X - 14 « 2.8 
5 

Median « 1 (apprpxlmate) 
X > Median 



If, there Is a negative skew (frequencies bunch up at high 
end), the mean tends again to shift towards the trailing 
off end (negative) of the distribution. In this case, the 
mean moves toward the lower end of the score scale and 
will be smaller than the median. 




X Median _1 

EX » 41 




^ Median « 10 (approximate) 
X < Medially 

Concluding Note; If a distribution 1$ skewed, the meap is pulled 
^toward the trailing off end^that is, the^end opposite from where 
'^•ihe bunching up occurs. Positive Skew: X > Median. Negative 

Skew: X < Median. 



74 



POSTTEST (answers in* back) 

1. For each of Che following, indicate whether the nean 
or the median constitutes the more appropriate measure 



D. 



of central tendency. 



10 
9 
8 

7 
6 
5 
4 



1 
2 
1 
2 
9 
12 
17 



B. 



30 
27 
24 
21 
18 
15 
12 




1 

2 
3 
5 
4 
2 
1 



C. 



50-up 
40-49 
30-39 
20-29 
10-19 




F. 



7 
8 
11 
8 
7 




A. 



For each* of the following distributions, select 
the numbet of the relative position of the mean 

and median. v . ^ 

^ (Assume this 

C. isi perfectly; 

bell-shaped) 




Mean # 

Median # 

D. X 

10 
9 
8 

7 



B. 






- # 






# 


# 




E. X 


F. X 


. 1 


30 


65 


2 


30 


4 


5 


30 




t 


29 




1 


4 


4 



ci) X-MrfJ 0>) X>Md; or (c) X<Md. 



75 

9«) 



\ Ciumven In back) 

"I. 

Listed belov are the* scores obtslned on s tiistory test: 



X 


X 


cf 






jU 


85-69 


16 


29 


80-84 


10 


13 


75-79 


2 


3 


70-74 


0 


• l" 


65-69 


1 


1 


What Is 


Peo? 




What Is 


P25? 





s. 

b. 

c. Uhst ls the F.R. for s score of 91? 
4. Whst Is the F.R. for s score of 80? 

e. What Is the mean? 

f. Nhat Is the nedlan? 
jS« What Is the node? 

h. Which Is probsbly the «ore aeanlngf ulj neasure of central 
tendency for this distribution , the men or the median? 

Why? 



For each of the following Items match the description to one 
of the following: y 

A.^ mean 
. B* median 

^ Cs mode 

a. the point of balance In a distribution. 

b« the most frequently obtslned score. 

c« generally the most useful mesaure of central tendency. 

d. the halfway point In a distribution. 

e. generslly the least reliable measure of central tendency, 
f* the numerical average* 

• ^50 



Answer each of the following with either A. mean or B. median. 

a. is preferd)le when the distribution Is symmetrical (not 
seriously skated) * ^ 

b. must be used when the distribution Is open-ended. 

c. Is Isrger when thi distribution Is positively skewed. 

d. Is Isrger irtien the distribution. Is negstlvely skewed* 



76 

9 



UNIT III 



VARUBILITY 



General Oblectlvea 

A measure of variabUlty is a single number that exprasses 
the extent to which scores in a set of data spread out or 
disperse around a measure of central tendency— usually the 
meam Low variability lndlcat*i that most of the scores In 
a set of data bunch up close to the mean while large varlr- 
ability indicates that most of the scores are spread out a 
considerable distance from the mean. The present unit 
designed to famillari«e the student with three variability 
measures—the range, variance, and standard deviation— and 
the computational procedures for each. 

Specific Objectives * . • 

3.1 Define variability 

3.2 Define and compute the range • 

3.3 Define and compute the variance 

3.4 Define and compute the standard deviation 

Tnfltruetional Unit ; Variability 

In Unhh I end II you .learned (or attempted to learn) 
how to characterise sets of scores in terms of a) the^m 
of the distribution, and b) central tendency. If someone 
walked up to you in the street and asked questions about 
your data, you could reply (for exampj|): '"ff^ ^ 
with a symmetrical, bell-shaped distrlButlon in |rtilch the 
mean.was 75. the median 75. and the mode 75.V Pbvlously. 
the stranger would be highly impressed, and if he too had 
completed the first two Units, he could ImedUtely 
formulate a pretty good "picture" of how your d*M 
If. however, he had completed Unit III. his reaction tP 
yoir repiy might be: "O.K. buddy, but wh*t *bout your 
vari ability— I am lost until you tell me that. • " 
sense, hi/reactlon would be entirely proper: in order 
to formulate an accurate picture of a set of data one 
really needs to knoW about form . *'*ntral tendency, and 
variability . 

Take, for example, the following t^ sets of data: 



77 , 9^ 



Set 1 



Set 2 



32 
51 
50 
49 
48 



5 
4 
3 



100 
75 
50 
25 
0 



3 / 

\l 

4 
3 



Each set consists of 19 scores (N • 19) • As far as form 
Is concerned^ each constitutes a bell-Bhaped, symmetrical 
distribution. As far ds central tendency Is concerned* 
<the meant oi6det and median are equal Co 50 In both sets. 
But are both setd of data the s'ame even .though they do not 
differ either In form or centi^ tendency? The answer Is 
obviously "no." In "Set 1* the scores are bunched together « 
whereas in Set 2^ the scores are spread apart. ThuSy we 
can differentiate between the two se^s of data In terms of 
their variability . 

Definition : Variability 'refers tq/ the extent tg^which 
scores in a distribution bunch up doae to* or spjread 
out far from* a measure of central tendency—usually the 
mean. Variability is expressed by a single number and^ 
like central tendency » there ar<s several measures of vari- 
ability. The most common are the range» variance» and 
standard deviation. In any case though » a single number 
expresses how compacted%the total set of numbers is. 
around the mean. By compacted is meant "how spread out." 

POSTTEST (answer in back) « 



Def ine variability 



0 



. ' ^ : ^ 

Instructional Unit : The Range 

The range is the simplest measure of variability to 
compute. However » it is also the moat unreliable for 
reasons to be discussed later. To compute the range, all 
you have to do is obtsin the distance from the lowest score 
in the distribi^ion to the highest, score in the distribu-* 
tlon. More simply* subtract the lowest score from the 
highest scors'-^the result is the range (th'is is almost as 
easy as computing modes). 



Range - highest acarliHinu* lowest 



52 
51 
50 
49 
48 



3 
4 
5 
4 

3 



■ 52.5^ (upper llnlt of top score) 
-47.5 (lower lUnlt of bottoa score) 



\ 



Set 2 



100 3 

^ 75 4 

50 5 

25 4 

0 3 



Range - 100.5 - (-.5) 
,Rsnge - 101 . 

*note - lower limit of 0 Is -.5 



Another Bxaaple; Determine the range^ In the following dis- 
tribution: m , 



X 

50-55 
44-49 
38-43 
32-37 
26-31 



f 
3 
2 
3 
8 
1 



Range - 55.5 (upper limit of top score) 

-25.5 (lower limit of bottom score) 



30.0 



Final note: The range reprfsents a total variability measure. 
That la, the*. range gives you a number that represents a 
distance where all frequencies above and below the meai^ Have 
been accounted for. A larte. number Indicates that acotea 
spread out far around the mfiti while a small number Indicates 
'that scores are bunched up close to the mesn. Note tne>-^ 
differences In the value of the range between Data Sets 1 & 2^ 
on the preceding page. 

While the range lytth'e easiest measure of varldlblllty tp 
Compute, It Is also the least stable, pimply changing one 
nuifcer can radically change the range as the foltowlng example 
shows. Therefore, If one can pick the measure of variability, 
the range typically i^liouW tiot be* selected. It Is^asy to 



confute, but can lea 




islderable mlslnterflrettftlon. 



79 



Exanplc ; 



10 ► 15 

9 9. 

8 ' 8 

7 but if 10 Is changed to H: 7 

6 6 

5 5 



Range - 10.5 - 4,5 - 6 Range - 15*5 - 4.5 - 11 

(or nea]^y double the 
previous range) 

POSTtEST (answers In back) 

1, Define the ran^e ^ 



2. For efch orthe following sets of data, compute the range 



Set 1 • 


^et 2 


Set 




X 


• 

£ 




87 


106 


2 


' 20-24 . 


83 


105* 


3 


15-19 


81 


104 


4 


* • 10-14 


80 


100 


2 


^. 5-9 


68 


98 


1 


0-4 


68 




3 


• \ 


.45 




4 






Instructional Unit : The^Var lands \j ^ 

As Indlfcated above (Unit 3.2)^ the range does not pro- 
vide a particularly stable measur^ of vai^lablllty since It 
Is fully dependent on the values of only t%90 scoresr^the 
highest and the lowest • A more suitable measure^ It would 
sean^ should take Into account ifll scores In the distribu- 
tion— -and their relative distances from the mean. Such 
a measure Is the variance * * ^ 

The variance la a tiumber <that represents the average 
of squared deviations around the mean« The larger the 
variance ls» the farther away scores tend to be (l.e^t 
deviate further) from th( mean. The smaller the variance^ 



80 



... ^ 



Che closer Bcorcs tend to be (I.e., deviate little) from the 
mean. By aubttectlnt the seen £roli eech ucorp In « die-, 
crlbutlon, equerlnt this deviation, end then taking tJlia 
■ean of theae squared devlatlona— the variance la oliU^ad. 

Let* 6 run Clurough ttiaC again: • 

1. Subtract th^ mmn from each score In the dlatrlbti-* 
tlan. ^ 1 ' 

2. Square each dWlatlon ^acore. ^ ^ 



3. Conpute their average • 



We can express this whole operation by the f ollovlng f omula : ' 

V, . . 

2 

where: o ■ syifcol for variance^ called slgma squared ^ 
(o Is the lower case Greek letter '^slgma") . \ 

« squared deviations from the mean^Csmall. 941s used to 



express a deviation score). 



X 



N - number of cases or frequencies (as It always has). 
ExJrole Problem : Compute varlahce for following set of data. 



x2 



24 1 4 16 

22 i 2 4 

20 1 0 0« 

18 1 -2 , 4 

16 1 -4 16 

EX - 100 Sx^ - 40 

X - 20 



1^ -IfcO - 8 ■ -S^ , 

5 (which la alBq>ly the meuSq,f the coltnm of 
mmbers) • 



If you exaalne the above computational procedures, you ^ * 
ahould readily under atand vhatfui did. ^ We were given our 
frequency dlatrlbutlon; (1) coaputed the tteui; (2) we 
subnracted the pean fro« each acore In the dletrlbutloot ' . 
an^ called the result "x"— for oonveatence i« added ani. "x" 
coluKi to the ttlatrlbutlon; (3) «« aquared aach x (or devU- 
tlon acore) to the dlatrlbutlonW4) i^'auMad all the irl* J 
and took their «verage by dividing by, M-^th« ratult, 8, im 
the variance, d.eflned as the average of |:he aquared dvv^OSKM**' 



3 

* • 

81 ' 



It ii important that you understand that the compu- 
tation of tha Tatlanca, unlike that for the range» takes 
Into account every score In the dlstrlbtttlon. Accordingly » 
It la a much mow aensitive and reliable measure of varl^ 
ability. 

In looking over^the formula for the variance ^ you may 
question vhy It involves AQua/Unq each deviation (x) score. 
Perhapst at first {^lancev It would seem a lot simpler and 
more sensible to just add the deviation scores and i:ake 
thelt average^ In other words » why bother to square? 

The answer to the above question may be found In the 
definition of the mean (Itolt 2.4.1). By definition^ the 
mean Is the point of balance In the distribution; the sum 
of deviation scores of all points above the mean Is atsMJUS 
zquat to the sum of the deviation scores of all points 
btZM the mean. For example ^ If you subtract^ed the mean 
from all scores abgpire the meant added those deviation 
^cores together you might get a sum of 492. Therefore » It 
follows^ that If you subtracted the mean from all scores 
below the mesn In the same distribution » the sum of those 
scores would be -*92. 

If you are still disbelieving » go back to the example 
problem on the preceding page and add the unsquared devia- 
tion scores (I.e., the x's). You will find thst for the 
two scores above the mean, Zx ■ 46. For the two scores 
below thelbean, Zx -6. When you add these two partial 
sums together the result If +6 (+). -6 - 0. This will slways 
be the easel Anytime, for any distribution you^can Imagine, 
the sum of the unsquared deviation scores (x*s) will always 
be equal to >zero. Obviously, then Zx would not be "helpful In 
determining variability, or anything else for that matter. 
By ^qua/Una the x*s we eliminate nosltlve and negative 
signs, Mid^thus will never get Zx^ - 0 (unless every single 
score was. equal to the mean). If none of this has made any 
sense to you, add the x*s (without squaring them) In all 
example problems, and youvvlll prove to yourself that a **zero** 
result must always be* the case. 

Take some time now to learn the formula for the variance: 



" <y2|-||x£ Remember, little "x" Is a deviation score 1 1 



compute 'the varlimce? But suppose, you computed the mean 
of yottr^ dls01butlon and found It' to be a number sueh as 
20.6. Tou could still use the above formula, but to 




should be relatively easy to 



97 " 



82 



determine Zx^^ It could get a little messy subtracting ^ 
from each score and squaring the result. You would 
obviously end' up with x*s with declmal^polnts^ and x^'s with 
decimals. 

Fortunately, by simple algebraic manipulation, an 
alternative formula has been worked out. 'This formula should 
|>robably be used whenever the mean (X) Is not equal to a whole 
number. The formula Is:' • 



EX'' - (EX)' 
N 



' N ■ 

wheire X « squared raw scores (not small x^or deviation, 
scores). Actually the numerator of the above formula Is 
equal to the numerator of the original fprmula, £x^. In 
both cases, you divide the numerator by N« 

The alternative formula Is called the raw score formula; 
It yields the same result as the original, but eliminates 
subtracting and squaring numbers with decimals. 

Example Problem ; Given the same data used In the preceding 

example problem find the variance by means 
^ of the raw score formula. (Since the 

mean ■ Zfh^^ whole number~we would, in 
. , actuality, probably be content to stick with 
* the original formula; but this is 
only for practice). 4 

X x£ ^ 

24 576 In order to use the raw scote ^ 

22 484 formula, we must find N, EX^, 

20 400 and (EX) 2. 

•18 324 ^ 

16 256 N « 5 (that is easy) 

EX » 100 EX^ » 2,040 EX^ » squaring each raw score 

And then adding them 
(EX)^- 10,000 • together, (see column 2) 

substituting: a2 - ex^ - (EX)^ 

N 



- 2,040 - ^10.000 
5 



83 



96 



o2 « 2.040 - 2.000 - 40 - 8 
5 5 

Note that this Is the same anstmr we obtained when the orlgl 
nal formula was used. ' ^ 

Two more examples will be giv4ti.,i}omicitlng the varldnce using 
the raw score formula/*^ ^ > *^ 

Example Problem ; Compute tfee variance using the raw scor6 

formula (in this case the mean t& not a 
whole number, so the raw score formula 
would be considerably easier to use than 







the original). 






X 


f 


fx 






fx£ 


10 


3 


30 


% 


100 


.300 


9 


2 


18 




81 ' 


162 


8 


4 


32 




64 


256 


7 


2 


14 




49 


98 


6 


2 


12 




36 


72 


N - 


13 


EfX - 106 




2 

EfX' 


- 888 


EfX^ - 


<EfX) 


2 - 888 - 


(106) 


2 . 888 - 


11.236 




N 




13 




13 • 



N 

888 - 864.31 - 23.69 
13 13 

Notes 



1.82 



(1) 



(2) 



Singe there are 13 scores, not 5, all 13 muat 
be added together and all must be squared; 
multiplying the X and %he x2 by f accomplishes 
this. Make sure you do not multiply (f)- (X) 
and then square, (fX)^ls not correct. ♦ 

1.82, the variance, represents the average of 
of a column of squared deviations from the 
mean. ' . 

Example Problem ; Compute the variance for 'the grouped data 

shown below. 



X 

20-22 
17-19 
14-16 
11-4.3 
8-10 
5- 7 
2- 4 



f 




fX' 




fX'2 










1 


21 


21 . 


441 


441 


3 


18 


54 • 


324 


972 


4 


15 


*f60' 


225 


$00 


6 


12 


72 . 


' 144 


864 


3 


.9 


27 


81 


243 




6 


12 


36 


. 72 


I' 


3 


3 . 


9 . 


9 



N^20 



EfX'- 249 



EfX' 2-3551 



84 



9J 



o2 - - (£fX')^ - 3,501 - (249)2 - 3,501 - 62.001 
N ' 20 20 



N 



20 



20 



3.501 ^ 3.100 - ifil - 20.05 



20 



20 



Notes: (1) Again 20.05 Is the average of a column of squared 
deviations around the mean. u 

• 

• • I- 
(2) Notice that the X scores In this case are the 
midpoints (X*) of the Intervals since » given an 
Interval t the midpoint Is our best estimate of 
the "typical" score In that Interval. 

1 

Final Notes ; 

A. Do not try to memorise all the different varieties 
of the formulas* It will be sufficient to simply 
learn the raw score formula and realise that In 
grouped type data^ the scores~X or X*~have to be 
^ squared and multiplied by f • The Important thing 
Is that all frequencies be accounted for. 

B« Remember 9 the varlakice Is an average~actually a 
mean. It Is the average of the squared deviation 
scores around the original mean . 

C. The variance value Is expressed In terms of the 

original raw scores except, that It Is In squared units 
For example 9 If the original data Is in^^hest then 
the variance Is expressed In squared Inches . This 
Is because the deviation scores are squared In 
computing the variance* 



As will be seen later I the variance formula that 
was given assumes that you have all the data on 
hand 9 l.e*^ the population* If you have" sample 
data and are trying to estimate what the variance 
would be In the population > then the numerator 
should be dlvldedrby (n*l) instead of (N)» That 
Is the only change In the basic formula* Thusi the 
sample variance » symbolised as s^» Is used to g 
estimate the pop4latlon variance* Small **n** Is 
uaed^tp Indicate that the ohiervatlona (people) 
«re pAi^t of a sample* 



4 



85 



Later oa In this textt we vlll describe the differences 
betveen ssnples end popnlstlons In considerable detail. 
For no|r» so that you at least have sone understanding of ^ 
vfaat dto '*s2/a^" distinction Is about » here la a brief 
explanation. When aeaauresents are taken » whether In the 
fon of test scores » physical characteristics » or sooiethlng ^ 
elAe» they can encoapass the entire group of Interest (the 
population) or only part of that group (a sBmple). Popu- 
lation^ j|te easy to assess when the nenbers are small In 
nmdbet easily accesslb;Le (sucl> as a class of students, 
wnuhetm of a football team. etc.). Otherwlset their 
assessment Is extremely dlsf Icult , such as whgpr the popula- 
tion of Interest consists of all students In an entire « 
school district I all college football players, or all living 
medical staff who worked for a particular hospital In- 1957. 
In such Instances, the statistician will usually be forced 
to work with a sample of Individuals selected from the 
population. The ssmple Is not interesting in Itself, but 
rather for what it implies about the population. More 
specifically, it is used to estimate what the popv^ation is 
like when all its members are Included. Thus, the variance 
statistic, s2^ is basically Important as an estimate of^ 
the population "parameter" (a new term), a2. It has been ^ 
found that, in the long run, using "n-1" in the denominator 
of the variance formula makes a better (mojre accurate) 
estimate of o2 than when "n" is used. And that is what the 
game of statistics is all about I The game continues below... 

The formula for s^ is: ^ 

(IX) 2 

s^ - Ex^ (original £X2 - n (raw score formula) 
n-1 formula) n-1 

Note that the only change from the formula we have been 
using is "that the denominator is n-1. If you are asked 
to compute the variance and the problem states that we are 
dealing with sample data, use n-l in the denominator. Other- 
wise, uae N, as we have been doing in the preceding example 
problems % 

•^^^ 

The reason that the denominator becomes n-1 is thst sample 
variance tends to be systematically smaller than the variance 
of the population from which the sample has been drawn. This 
mskes sense because a sample is not ^Ikely to have the com- 
plete range of scores contained in the popu]pitlon. The 
denominator n-1 produces a somewhat larger variance and is 
called an unbiased estimate of the population variance. 
While the difference in the variances obtained by use of n 
and n-1 is rather trivial in large slaed ssmplM, it can be 
quite signiflcsnt in smaller samples. Therefor* , it Is 
especially Important to use the s2 formula when sample sl«e 
is smKHl. To be safe, always use s2 lAen you are dealing 
with sanqple data. ^ 



ERLC 



86 



10 i 



Is Added to» or aubttacted from, a aet 
ita, varlabUltf the variance - irlll not be altered, 
for exaaplet 2 polnta vere added to every acore 
diatriJbutlQn. Vould the Man be changed? Ob^ 
it would; alnce every acore la Increaaed by 2, 
vould alao becoae 2 polnta higher. But^ vould 
theu^ad of the acorea (l.e.» the variance) be altered? 
Mo^^alnce every acore would be alallarly affected, they 
vould reaaln juat aa cloae together aa they vere before 
the 2 polnta vere added. Another way of understanding 
this la to think about the forsula (or definition) for 
the variance: the ave4^aae of the squared deviation . 
scores. How do you get a deviation score? Answer: 
subtract the mean froa each acore. But - If the mean Is 
Increased by 2, and each score Is Increased by 2, would 
the deviation scores change fron what they were before 
the polnta were added? Answer: an emphatick,^" I 

Example Problem : We find a?"¥ 25 In a certain distribution. 

We then add^ 1» 000 points to each score. 
What would the new variance be? 

Anawer: " 25. No change. • 

If a set of data la miltlplled or divided by a constant, 
the variance Is affected by the square of that 'constant. 
Thus, for exanple. If all scores are doubled (multiplied 
by 2), the variance Is 2^ or 4 tlaes as large. If all 
scores are divided by two, the variance becomes 1/4 as, 
large; If all scores are mult^led by 3, the variance 
becomes 9 times as Isrge; If all scores are divided by 
10, the variance becomes 1/100 as large, etc., etc. 

Let *s examine how thla occurs throu^ -the following example. 

Example ; A teacher gives a 50-ltem test to three students 
and obtains the following results. 



Stud«it. A 47 2209 She cooqputes the variance as 
Student B 42 1764 follows: 
Student C 30 900 ^' 

o2 - 4873 - (119)2 4873 > 472tt33 

3 ■ 3 



o2 - 50.89 ' 

To make the reaulta co^?arid>le to the 100- Item teata aha ^ 
haa prevloualy given, ahe decides to multiply each atudent a 
acore by 2* Thus, she getsf 




87 



lU, 



8tudi»t A 
Student B 
Student C 



X 






94 


8836 




84 


7056 




60 


3600 




238 


19492 





She coflpu^ the variance: 
19492 - C238)2 



- 19492 - 18881.33 

1 3 T 

o2 - 203.56^ 



Coi^are the original and new variances, and you %d.ll see 
that thii latter la 4 tlaea larger, as the rule predicts. 
For our next trick, we will nake a postteet appear from 
what vas previously a coopletely blank page. Watch 
below. 



3.3 POSTTEST (answera in bfck) 
" 1. Define the variance 



2. For each set of data below, covpute the variance. 



Set 1 


Set 


2 


Set 


3 * 


2L 


' 1 


t 


X 


± 


26 


42 


2 


12-14 


4 


24 


40 


3 


9-U 


2 


23 


38 


4 


6- 8 


1 


22 


36 


4 


3- 5 


2 


20 


34- 


6 


0- 2 


3 


19 


32 


3* 






17 


30 


1 






15 






m 





3. Suppose that in all three sets of data, each acore was 
decreased by 10. Uhat would the new variances be? 

4. Suppose that In all three sets of data, each score wss 
wiltlplled by 3. What would the new variances be? 



c 



88 1 0 j 



3.4 Instructional Unit : The Standard Deviation 

The third C«nd final) flieaaura of variability you vlll 
be aaked to learn la the standard' deviation . If you 
were aucceaaful In learning hov to coapute the variance* 
the following ahould coae aa vary good nam: The atan^ 
dard deviation la alaply the aquare root of the variance , 

ThuSp all you hava to do la (1)J conpute the variance; 
(2) take Ita aquare root; and C3) you end up irlth tlie 
atandard deviation* There la really not nuch new here 
to learn. 

Recall that the 8yid>ol for the variance vaa 07 C or jb'' 
If It la an eatlmate of the population variance) - so 
the atandard deviation la aynbollsed aa a (or If you 
are eatltaatlng the population atandard deviation). 

Therefore: basic formula raw^acore formula 

r 



ERLC 



■ ST 

>J N 



V N 



- (gx)2 



Bxagple Problem : Suppose ve find that for a glyen popu- 

latlon» the variance la equal 'to 64. 
What Is the standard deviation? 

Anawer: o » 8 (alnply the aquare root 
of the variance) 

EyMmpic Problem : (k»q>ute the atandar^ deviation for the 

following data. 

Note: little 'V* Is a 
deviation score 1 1 



X 


* . 


£ 5L 


s 


2 


4 25 


4 


1 


1 16 


3 


0 


0 9 


2 


-1 


1 4 




-2 


4 1 


EX-15 


Ex2-10 EX2-55 



X-3 



89 



Using the basic foroula: 



Using the raw score fopmila: 



o -/SS - (15)^ /55 - 225 ^ : ^ 

Example Problem ; Using the raw score formula, compute the 

standard deviation for the following 
^ grouped data. ^ ^ 



X 


f 


XI 


fxl 


X± 


fX'^ 


14-16 


2 


IS 


30 


225 


ASO 


11-13 


3 


12 


3€ 


144 


432 


8-10 


3 


9 


27 


81 


243 


5- 7 


2 


6 


12 


36 


72 


2- 4 


1 


3 




. .9 


9 




N - 11 




EfX'-lOS 


EfX' 


2-1206 



a »/la206 - (108)^ /le206 - 11,664 i- 1 — 

/ 11 m / 11 « /l,206 - 1,060.36 "/ 1A5.64 

yj^ — Ti y n V 11 yl 11 

Vl3.24 - 3.64 ^ 

( Like the variance, a Is a measure of variability. The 
larger the a» the more spreadout the scores are from the 
mean. The smaller the o» the more bunched jup ^the numbers 
are I'elatlve to the m|an. Also, like the variance, the 
standard deviation could be 0 if all numbers in a set of 
data are the same (no variability). The standard deviation 
can be a positive number, if there is some variability; how- 
ever, it can never be a negative number. 

One useful property of the standard deviation is that — 
assuming the set of data resanbles the generaJL pattern of 



90 



ERIC 



a bell-shaped curve (or normal distribution), certain 
percentages of N (the total amount of scores) lie within 
certain distances or standard deflation units from the 
mean* 

For Instance: t 

^ a) 68% of the scores fall within + 1 standard devla- * 
tlon from the meian. (The 68Z Is approxlroateTJ 

b) 95Z of N fallfe within + 2 standard deviations from 
the mean. ' 

c) over 99Z of N falls within + 3 standard deviations 
from the mean. 

Thus, for example, If we have a bell-shaped dlsltrtbutlon 
with a mean of 60 (X « 60) and a standard deviation of 5 
(a > 5) , we can conclude that approximately 68% of the 
scores fall between 65«(one a above the mean) and 55 
(one o ^elow the mean); and that approximately 95% of the 
scores fall between 70 (2 a above the mean) and 50 (2 a 
below the mean). Don*t worry too muCh about this now 
(If you do not imderstand It) ; worry about It when you 
^ get to»Unlt IV. 

In any case by knowlng^X and o we can construct an Inter- 
val one a wide around X such that 68% or about two-thirds 
of , the scores will f all jrlthln that llKerval. In dlstrl- 
butlons^that have a> siMian standard deviation, the Interval 
around X Is short sinde 'the scores bunch up close to X. 
In dlstrlbutlons^wWre th^ standard deviation Is large, the 
Interval around X Is lai^g^' since scores spread out farther 
frop the mean. In elthet tease, the percentage of the total 
N (68%) Is the same. What Is different Is how far y^u have 
to go#out from X to obtain the 68%: If the a - 3, you hkve 
to go 6^ points; If o - 5, yov have to go out 10 points, etc. 
The larger the standard deviation ^ the farther out you have 
to go to Include two-thirds or\68% of the scores. 



Final Notes; 



A. The standard deviation Is the square root of the 
variance. \ 



B. The raw score figrmula Is i usually the best way to 
compute the standard deviation, especially If the 
mean Is not equal to a whole number. 

C. The standard deviation value is expressed In 
exactly the same terms 'as the original data* If 
the raw data Is IQ scores , a is expressed in IQ 
units; if a « 15 for IQ .scores, the IS means 15 
points. In contrast, you will recall, the variance 
would be expressed in squared IQ units. Thus, 

91 lUd 
I 



r 



tha valutt of tha vriance Is not directly conparable 
to the original cm score units » but the value of 
^ the standard deflation can be«dlrectly compared. 

D. If you are esttoatlnR ^he population standard devla-- 
tlon» the denominator should be ' n^l. rather than N. 
Therefore, s>or the estimation of the population Is: 

n 




•n-1 

B. Like the variance, adding or subtracting a constant 
froa a set of data has.no effect on variability — 
I.e. 9 the standard deviation. However, If a set of 
data Is imiltlplled or divided by a constant, the 
standard deviation Is affected by the amount of 
the c<^tant. For example. If the constant 4 Is 
added to every score, the standard deviation remains 
unchanged; but if, every number Is multiplied by 4, 
the standard deviation will be 4 times Its original 
value. 

3.4 POSTTEST (answers In back) 

1. Define the standard deviation 



\ 



2. For each set of data below, compute the standard devla«- 



ERLC 



tion. 








Set 1 


Set 


2 


Set 3 


X 




£ 


X f 


17 


10 


2 


21-23 6 


U 


9 


3 


18-20 7 


13 


8 


7 


15-17 k 


9 


7 


6 


12-14 3 


8 


6 


k 




3 


5 


2 


6- 8 2 










(Saapl* mU). 









3. Suppose that In all three sets of data, each acore was 
increaaed by 3. What vould the new standard deviations 
baT . 



92 



Unit III Revlgiy Test 
(answers In back) 

Conaldei: follofvlng frequency distribution : 







Oil 


1 


19 


i 


18 


5 


17 , 


4 


16 


4 


15 


. 3 


U 


2 


13 


I. 


12 


2 


11 


0 


10 


1 



Aii6%ieT the following: 

> , 1. EfX 

J 2. N 

3. X 

^ 4. EfX^ 

5. (EfX)^ 



6. o2 ' J* 

• 4 I 



7. .2 



8. o 

10. the range 



- I* 
. J 



ERIC 



o If ^ point la ixubtracted £rom evety acore 

12. 9 If every aconi la divided by 2 

13. o^^l evmry acore la divided by 2. 

14. £ If every acore la mult^lp^led by 6. 

15. If every apore la aultlplled by 2 

16. £^ If 2, polnta are added to every acore 



93 lUo 

9^ , . . I 



•UNIT IV 

POSITION MEASURES AND THE NOBMAL^RVE 



General Objectives 

The^purpose 6f this section Is to Illustrate a basic 
procedure used to describe the location of ^pCGC^C iCOW^ 
within a distribution of data. Ifeasures of central tendency 
Indicate the "typical'' value; variability measures indicate 
the degree of ''spread" in the distribution; but position 
mzoAuAZ^ locate sp^lflc points within the data set. 



Specific Objectives 

4.1 Define and coiaptite z^^cpres 

4.2 Convert specified scores into their appropriate raw 
scores / 

4.3 Convei^ raw scores and scores into ^ scores 

4.4 Describe and make a sketch of the normal curye 

4.5 Show the raw score and score scales on a normal curvb 
diagram 

4.6 On a normal curve, Xabel the approximate percentage of 
scores between Z and + or - 1 standard deviation, 2 
standard deviations, and 3 standard deviations 

4.7 Using the "area under the normal curve" table, find 
the pyc^ntage of scores between any^ two specified 

' score Values ' x» 

4.8 Using the "area under the normal curve" table, find 
the percentage of scores between two specified raw 
scores ' ' ^ . 

4.9 Using the "area under the normal ci^rve" table, convert 
specified scores to percentile ranks, and specified 
percentile raifks into z^ scores 



94 



/ 



iMtrttctlcmal Unit ; ^ Score* 

Percentile ranks, ee discussed In Uialt II » do provide a 
nessure of AP^CA^jia Acjonjt potAJUSti^ but cannot Indicate 
^nm y^M giift'tiMiy hoy imirh bsttsr One nersonis score Is relative 
to another's. For exanple. If Bill scored ^ 87 on s test, 
the highest score In the class, his percentile rank would be 
close to Pioo* His friend Ernie scores 77 which Is deter- 
mined to be Pso* Bill luis thus scored 20 percentile points 
higher than Ernie. But suppose an error was made in grading 
Bill's psper, and his actual score Is found to be 97, as 
opposed to the orlglnsl 87. Since 97 would obviously be the 
highest score In the class. Bill's percentile rsnk would 
still be Pino* other words, even with 10 points added 
to his original test score. Bill Is still only 20 percentile 
points higher than Ernie; changes In raw scores do not always 
produce changes In percentile ranks. 

In light of the limitations of percentile ranks, a dlf- 
ferenit position measure - scores - will be Introduced and 
described In this Unit. In sddltlon, relstlonshlps among 
percentile ranks, z scores, and the normal curve will be discussed. 

A £ score, like s percentile rsnk. Is a positions measure. 
By position measure Is meant a single number thst represents 
a specific locstlon point within a distribution of scores. ^ 
It Is not s dlstsnce messure like the standard deviation, or 
a "representstlve" nuniber like the mean, mode, and median. 
Rather, It Is s point measure. A percentile ^ank, for Instance, 
la s specific position measure that states whst perce^Ftige of 
scores fall below a certain point Inxhe distribution. 
Another position^ messure, cslled s £ score, Indlcstes how 
msny stsndsrd devlstlons awsy from the mesn s certain score 
Is. In either case though (percentile ranks or z scores), the 
nuoiber given Indicates some point In the distribution. 
Given this "point," different scores In the distribution csn 
be compered In terms of their locstlon. This psrtlculsr 
function of z scores should become more understsndsble as 
you work through this unit. 

Definition : A jk ^co re i s a number that indicates how 
many standard deviations- a raw score is awiqr txtm the mesn. 

The whole bsllgane is in the sbove sentence. Resd It sgsin. 
So, if s person told you that his/her £ score was equsl to 
1, would you know anything interesting? Yes! You would ^ 
know that he/she scored one standard deviation anay from the 
mean. 



95 



t 



gomltlve « acores Indicate that rav acorea are a certain 
dlatance above the neani whereia negative acorea Indicate 
that raw acorea are a certain number of atmdard devlatlona 
belon the nean. A acoffe of 0 Czero) Indlcatea that the 
rav acore la exactly equal to the mean In other words, no 
atandard devlatlona » or part of a atandard deviation, above 
or beloir the mean. 

Read the above paragraph again. Now anawer the following 
queatlon: If a atudent tella ^u hla/her z^ score 1^ equal 
to what doea this mean? 

Anawer; that atudent scored one standard deviation above the 
' mean* 

Queatlon; S^poae the standard deviation, a, was computed to 
be S^, what doea thla mean? 

Answer; The atudent scored 5 points (I.e., one atandard de- 
'vlatlon) above the mean. 

<}ue8tlon; Suppose the mean on the test was 75; what would 
that Indicate? 

Anawer : The atudent' received a score of 80 points on the 
teat; 80-75 (the mean) - 5. The standard deviation, you will 
recall, waa 5 points. Therefbre^ jb was -^1* 

Question; * Suppose, another student receives a z^ score ■ O.- 
What doea that suggest given the above test results? 
Anawer: The student received a score of 75 on the test - 
becauae whenever jb « 0, the ^ raw score la exactly equal to 
the mean* ^ 

^Hopefully, now you are ready for the z acore fortnila. It la 
perfectly conalstent with the logic used In deriving £ scores 
and raw scores In the above example. If you didn't understand 
the example, go back to it once you feel comfortable with the 
z^ acore formula. •* C * 

a ' 

In plain Engllah,^ the formula says: 1) Subtract the mean 
(X) from the raw acore (X) for which z^ la to be cosqputed. 
2) Divide the result by the standard deviation ((7> of the 
dlatrlbutlon. 3) The final product Is the z^ score for that 
particular raw acore (X)« 

Exanyle Problem; I give a teat and find the mean 0^ to be 75 

and the atandard deviation to be 6. I am Intereated In com*- 
putltig a jB acore for a raw score of 69 received by one of k 
the atudenta taking the test. What Is z^ for the raw acore, 
697 (or phraalng the aame queatlon In a allg|itly different 
manner) How many atandard deviations away from the mean Is 
the raw acore, 69? 



Hi 



X - X 69 - 75 , . -1 
£--T^ "66 

z " -1.00, Indicating that the student scored one standard 
deviation beloir the nean. 

Example Problea; ' In the pame distribution as above, what would 
be the z_ score for a raw score of 75? 

75 - 75 , 0 . n nn 

- 6 6 

the student scored no deviations above or below the mean. 

Example Problem ; In the same distribution, what would be the 
score for a raw score of 87? 

, - 87 - 75 .12 

- 6 6 

z - +2.00 

Example Problem ; Assume X - 19.8 and a - 3.4. Find the z_ scores 
for raw scores of 22.4 and 14.9. 

For 22.4; z - 22.4 - 19.8 

3.4 

z - 2.6 
z - +0. 76 

(in other words, the raw score 22.4, is .76 standard 
deviations above the mean) 

For 14.9; z - 14.9 - 19.8 

" 3^5^ 

z - •^4t9 
~ 3.4 

z^ - -1.44 

Ctn other words, the raw score 14.9, is 1.44 standard 
deviations below the mean) 



Final Notes : 

1 Make sure that you have the correct sign for the jb 
. score. A negative sign Indicates that the score is 
below the mean, whereas a positive sign indicates 
that the score is above the me/ 



97 



11 ^ 



ERIC 



2. £ scores are comparable from one distribution to another, 
regardleas^ of theWans and standard deviations. AIL a 

score of 0.50 aajs Is that, regardless of the size of 
0, a certain score Is .5 o Qialf a standard deviation) 
above the mean. If It was determined that your score on 
a basic algebra test, for example, was equal to a £ of 
0.56, and that your score on a basketball free**throw 
shooting test was equal to a of 0.56, we could say 
' that you performed* equally well on both. In each case 
you scored .56o abdve the mean. 

3. A £ score Is a precise position measure In that It lo*- 
cates a rair score within a distribution by Indicating 
how many standard deviations away from the mean that 
score Is. 



POSTTEST (answers In back) 
1. Define £ score 



2. For each of the following, coiqiute the appropriate £ 
scores for the specified raw scores. 



A. 


X - 


.8, a - 


.03: 


X - 


.91, 


.87, .80, .72 


B. 


X - 


92, a - 


7 : 


X - 


100, 


99, 90, 82, 78 


C. 


X - 


0, a - 


1 : 


X - 


1.5, 


-2, 0, -.4 



Instructional Unit ; Converting £ scores Into raw 

score equivalents 

nils unit should not present any great difficulty, as- 
suming that you understand the material In the preceding 
section. Given the mean and standard deviation of a dis- 
tribution, you now should be able to convert various row 
scores Into their corresponding jb scores. The present 
section simply Involves the reverse process: given the 
mean and standard deviation of m distribution, you will be 
ask^d to convert £ scores Into their corresponding raw 
scores. 

For example, I tell you that on a geography test X « 70 and 
a 5. John, a brl^t student, received £- 3.00 on that 
particular test. What was the actual raw score that he 
received? 



98 



To answer this question, ve use the sana logic that we 
used In the preceding section, but In reverse. A js - 3.00 
neans thst John scored 3 standard deviations above the mean. 
Looking at the results of the test we know that a - 5, and 
therefore that 3a - 15 points. JohP. therefore scored 15 
points above the mean. Since ttif laean was reported as 70, 
we conclude that John received a raw score of 85 on the 
geography test. Hooray for Johnl If you are afraid you 
will run Into trouble by merely using logic (and you will 
on more difficult problems), the basic £ score formula Is 
shown (once again) below. It Is the same formula that you 
used In converting raw scores Into scores. 



But, since now we are looking for "X" rather than 'V* a 
simple rearrangement of the formula should^^rove desirable: 

£ a - X - X (cross multiplying) 

X - X - £ a (switching sides of the equation) 

X - £ a + X (adding X to both sides of the equation) 

We still have the same equation, but It Is now In a slightly 
different form. In effect, we have Isolated X, thus m«dclng 
It easier to compute Its value. Using the above data re- 
garding John and the geography test we can confute his raw 
1 score as fol)pws: 

j X - £ a + X - 3.00(5) + 70 - (15 + 70) - 85 

1 Jhe equation corresponds perfectly with the loglc^e^^-iOK^the 
Introductory example. Read the example again, bjttltnls tli 
foll^i^the equation. Hopefully, It all makes^nse, since 
nothing magical has taken place: the £ score formula has 
simply been used In reverse. 

Eyi^le Problem : Asiume X • 60, a - 3. What Is the raw score 
for z - 0.70? ' 

0 

X ' xa+X 

* C.7) C3) + 60 

- 2.1 + 60 ^ 

- 62.1 ' 



99 



Kxanple Problem; Aaaune X « .3, a « .06f What are the row 
acorea for £ acorea of 2.3 and «^.97 

For £ ^ 2.3: 

X * (2.3) (.OB) + .3 

- .138 +5 

- .438 ^ 




Some Proper t lea of z Scores 

Before turning to the next Inetructlonal unit, aome 
properties of £ that were not listed In the "apeclf Ic ob- 
Jectlvea'* aectTon (first page of this unit) will be briefly 
revlewedr^Here la the first: Suppose that you take a final 
exam In your two hardest courses, nuclear phyaica and advanced 
baaketweavlng. You get the results back to find that your 
acore In phyalcs waa 75/100, whereaa your score In basket- 
weaving was 86/100. When you show these results to your 
parents (or friends) they conclude that your major atrength 
la In baaketweavlng and encourage you to turn profeaslonal. 
la their conclusion correct? 

The anawer Is maybe yea and maybe no. Hopefully, you are 
now aware that the Interpretation must depend on additional 
Information, such aa your relative atandlng to othera^ln the 
two classes. Your physics score, even though It waa a meaaly 
75, could have been the highest score In the cllss. Your 
baaketweavlng acore, even though It was an a^ral^e looking 
^6, could have placed you In the lower echelon of all weavera 
who ever wove. clearer baala for comparlaon would be pro- 
vided If you> computed your jb score on each teat. Let* a aay 
you do (all you would need are the mean and the atandard de*- 
vlatlon for each), and find thdt for phyaica, £« 2.32; and 
for baaketweavlng, £ - 0.58. Conclusion: you were better 
than average on both testa, but on that particular day. and 
*^ relative to your claaamatea In each coifrae, you were a better 
physics student than you were a baaketweavlng atudent. Hopea 
for a ptofeaalonal baaketweavlng career take a aerloua plunge. 



100 



ERLC J^l 



The point we ere trying to convey ie thet z scores allov 
you to mske comperlsons between performances on different 
tests. Regardless of rawtscores 0«h«|;her they be In the 80*8 » 
90's, lOO's, or 5,000's), the £ scores position you relative 
to others In your group. If l.*6 In biology » and - 1.46 
In English » we can Infer that^relatlve to the other students 
in those classes, your two performances were roughly equivalent. 
In each case, you scored 1.46 a above the class mean. The next 
problem Is optional, but you will probably want to give It a 
try. It could boost 'your ego tremendously If you get the right 
answer. 



Optional Super-Bonus Question 

A student gets a score of 85 on a test where X - 100 and 
a ■ 15. What would be aconparable score on a test where X • 40 
and 0-6? (Cover the sototton steps below, as you should at- 
tempt to solve this on your ownV) 

Solution Steps y\ 

The logic used to solve the ^oblem Is very stralgjitforward: 

(1) For the performance on the ^o tests to be equivalent, 
£ (Test A) must be equal to £ (Test B). In other words, 
both test scores should be the same relative distance 
from their respective aeans. 

(2) The only possible way to proceed Is to find the £ score 
for Test A: ii 

X ~ X - 85 - 100 - zil - -l.OO 
"IT" 15 15 ^ 

(3) Now we know that an equivalent score on Test B would have 
a z score of ->.00. It would be one standard deviation 
beTow Its mean. WiS proceed: * 

X (Test B) - o + X 



X - -1,00 C6) + 40 Cremea^er, these values are for 
* Test B) 

X - -6 + 40 
X - 34 

Thus, eqtilvalent scores on the two tests woulil be an 85 for 
Test A and « 34 for Test B. (Both convert to s,- -1.00.) 



101 



116 



Anothey z Score Property 

* 

Any tloe you have a groiq> of acore£» It la usually of 
Intereat to find Its central tendency CX) and Its variability 

Almost always 9 that Is... 

« 

When rair scores have been converfied to scores » central 
tendency and variability are always the saxoe from one distri- 
bution to the next. Here's what happens. Suppose you give a 
test to a class of 30 students. Tou compute the raw scores 
for all students » and then the raw score mean^and standard 
deviation. Then» using the £ score formula, you convert each 
of the raw scores to a jb . Looking over .the £ scores, you be- 
gin to question what their mean (average) would be, so you add 
up the 30 £ scores and divide by 30. To your amazement, you 
get a mean, X, equal to 0.00 (sero). This arouses your 
curiosity, so you then enter the 30 £ scores Into the standard 
deviation formula to determine their variability. When you 
are done, you find the standard deviation of the £*s to • 1.00. 
You are disbelieving, as you have learned 'from all your math 
courses that problems never work out to give such nice whole 
numbers like 0 and 1. A more reasonable answer would be 
somthlng like 1*05734. That you could trust. To Investigate 
this further, you wander arotmd the school building, collect- 
ing test results from all other teachers. For each test, you 
convert the scores to scores, and then compute the z^ score 
mean and standard deviation. In each case you get, X «• 0, 
and a - 1.00. 

The lesson, to be learned Is as follows: when all raw 
scores In a distribution are converted to z score8» the mean 
of the z*s will always be 0 and the standard deviation wil~ 
be 1.00 and the form will be uncTianged , 

For practice, try the following problem: The distribution 
shown below Indicates students' scores on a chemistry test. 



If these scores are all converted to^'s, 
what would (a) the mean of the j's be? 
(b) the standard deviation of the ^'s be? 

(If this problem takes you more than 5 
seconds to answer, you do not understand 
the concept) 



102 

117 

I 



X 


I 


95 


3 


90 


10 


65 


16 


80 


30 


75 


17 


70 


15 


65 


4 



ERIC 



I 



id 

ERIC 



4.2 P0STTE8T (Answers In back) 

Convert the £ollo»rlng s. scores Into thetr appropriate raw 
scores : 

1. X - 100, ft - 15: jB. scores of 2.20, 1.00, -2.00, O.QO 

2. X - 1.2. a - .3: z scores o£ 2.00, 0.30, -0.40, -1.30 

3. X- 50,0 - 4: £ scores of 3.00, 2.00, -2.00, -2.80 

4.3 Instructions! Onltt Converting z scores ln6<^T scores 

There Is no question thst s. scores have practical 
utility. At all large universities (and tnost colleges) 
cofl^uter progrsBi^sre available for use In scoring exaal- 
C nstlons shd in snalysing dsts from resesrch experlnents. 
If you hsve s clsss of 30 or 40 students (which Is connaon 
St the college level) It Is ususlly much more efficient 
In teras of tlae snd accuracy to use machine scoring on 
objective tests ss opposed to scoring the tests by hsnd. 
You also obtsin s lot more Informstloi^ from computer 
scoring such ss means, sn item snslysls* and yes, z, 
scores. Some. Instructors ignore (or de-emphsslze) raw 
scores, and use the ^ scores for determining grades 
(others, of course', totslly Ignore the jb scores, and some 
hsve no ides whst the £ scores represent) . To give you s 
genersl ides of how z scores might be used, the Instructor 
knows thst s student who received a £ score of 0.00 was 
exsctly sversge (scoring right on the mesn), snd thus 
•might awsrd him* or her s C; a student who received a z 
score of +1.00 (pretty good) would probsbly be awarded an 
A, etc. There is s little more to it then these examples 
would indicate, but the point is thst it is somstlmes 
essler for the prof to wsrd s grsde fairly if he knows, 
for example, thst John Smith received t - +2.00 than if he 
only knows thst Smith answered 18 questions out of 20 cor- 
rectly. Z scores csn be helpful, but whether they sre to 
be used o7 ignored is up to the whims and biases of the 
individual instructor. A«k your ststlstlcs inrnfructor to 
show you (if you sre Interested) s copy of s print-out 
from s coaputer-acored exsmlnstion. Tou will discover 
thst the print-out contains most of the statistical indices 
with which you have become familiar in the preceding units. 
All of these Indices sre Intended to help the ins true tot; 
underatsnd hew students performed, snd suggest wsys In which 
, the examination mlg^t be Improved for subsequent usage. 

All rl^t, you say, but what is the objective of the pres- 
ent instructional unit? To Introduce the objective, it mi^t 
be helpful (or relevant, at least) to point out that at many 
universities, computer print-outs of exam performance 

103 

lid 



uae %ihat are called T scores » as opposed to scores » to 
Indicate students* relative standings. The i^eason for this 
relates to several disadvantaged of the jb statistic: One 
disadvantage la that In any distribution* approximately half 
^ the £ scores will be negative Clndlcatlng raw scores some 
distance below the mean; • As you probably realize, negative 
nunbers make calculations more difficult, and grade-book 
suaurles less Interpretable than would be the case If only 
positive numbers were employed. Another disadvantage of 
the £ statistic Is that nearly all of the scores will Involve 
decimals; for example, j| 1.50 (for a raw score 1-1/2 o 
above the mean). Rarely, will the scores end up whole 
tLuBbevB^ such as, +2.00, +1.00, etc. The preaence of decimals 
obviously makes calculations more difficult and confusing. 

Recognising these disadvantages of je scores, it* would 
seem desirable to employ an equivalent measure which ellml*- 
nates the burden of working with negative numbers and decl*- 
mala* Such a measure is called a £ score; it tells you the 
same thing that a score does, but is expressed in terms of 
positive. Whole nuld>ers. X scores are so easy to work with 
that in some university courses grades are determined ex* 
cluslvely in terms of T score averages or totala. You might 
have heard the term '^gTadlng on a curve. 'V 2 scores offer a 
way for this to be done. 

If you know how to calculate £ 8COx:e8, the computations 
of T scores should presents little problem. You will recall 
from the preceding unit that Z » 0.00, and ^ 1. For a T O 

8.core distribution, however, T * 50 (that is, the average 
T score is always/^ to SO) and (^^ ■ 10 (^he standard 

deviation of the T's is » to 10). We derive our T scores 
directly f rpm £ pcores using the following formula: 



Thus, if we know a student^ a jb score, finding his T icore is 
a relatively siiiq[>le endeavor.*^ By changing z to T we eliminate 
negative nuoibers and decimals. 



T - 10£ + 50 



Example Problems t 

1. John's £ score on a test is +1.00. What is his T 
score? 




Answer: T 



10 (1.00) + 50 



T 



60 



104 



llj 



ERIC 



2. Floyd* • £ acote U -2.06. Conwtt to « T acore. 
Answer: T - 10 (-2.06) +50 

- (-20.6) + so' 

- 29.4 (rounding off) V 

- 29 . . 

3. Reginald* a r«» ecore on a Health 4062 exam la exactly 
equal to the claaa nean. What la his T score? 

Answer; If he a cored on the mean, £ - 0.00 (Right?) 
%m 10 (0.00) +50 

- 50 

(a reasonable answer since f - 50 - see page 104) 

4. Bdwlnna acored 3o below the mean on her sewing inld-tenn. 
What la her T acore? 

Answer: If she Scored 3o below the mean we know that 
£ - -3.00. 

T - 10 (-3.00) +50 

X ' 20 (poor Bdwlnna) 

Also, reMober that f - 50 and q_ - 10. 

T 

Knowing that Edwlnna scored 3o below^the aean • 
tranalatea to 30 polnta below the T mean, which 
la alwi^s 50. Thus, again we find that for 
poor Edwlnna, T - 20. If thla logic la confus- 
ing stick to the above formula ualng jb a. 

5. On a bone-chewing apeed teat for doga, X - 40 aeconda 
and a - 5 seconds. Rover chews his bone In 42.5 seconds. 
What la lover* a T acore? 

Answer: First eoapute t, 

42.5 - 40 2.5 ^ 

X § -y - + . 50 

Finding T should be easy. 
T - 10 (.50) + 50 
- 55 

S 

105 



120 



To shw the relatlonihlp betiiten rm scorM^ x scores » 
and T scores » soae selected perfomances frcm s hypothetical 
ezai£:iatlon are ahoim below. The nean on that ezaiiDinatioii 
was 75 and the atandard deviation vaa 5. Kee^ these figures 
In nind lAen viewing the resultant jb and T scores. 



Student Nane 


Rur Score 


s Scote 


T Score 


Bdrnie ^ 


90 


+3.00 


80 


Petunia 


87 


+2.40 


74 


Louise 


85 


+2.00 


70 


Arthur 


80 


+1.00 


60. 


Archie 


. 78 


40.60 


56 


Flo 


75 


0.00 


50 


Boris 


74 


-0. 20 • 


48 


Beatrice 


70 


-1.00 


40 


Xavler 


68 


-1.40 


* 36 


Sally 


65 


-2.00 


30 . 


King 


60 


-3.00 


20 



You will 4iote that each T is 10 times its corresponding 
Zp plus 50a Answer this: if every score in the class was 
included above » what would the average £ score be? 

Answer: 0.00 

If every score was included » what would the average T be? 
Answer: 50 ^ 

'ii 

Ife had mentioned earlier that T scores can come into pl^r 
for grading on a curve. If the* instructor wanted to grade on 
the curve » he might estsblish the following cutoffs for let- 
ter grsdes. 

4. 

T Score Gtade 

59 or above A 

54-58 B • 

47-53 C . 

42-46 . D ' • 

) 41 or below F ^ 

Note that aln'ce T acorea, like je acorea, Indicate dlatancca 
froB the claaa aean, thla procedure enaurea that aoite atudenta 
will get gradea^ below "C" and aeae will get gradea above /'C* 
AaavMlng that the distribution approxlmatea the "bell-ahaped" 
polygon that you learned to Identify In Unit I (and will 
atudy In aore detail M tdie loaedlately following section), 
thera will be a greater proportion of "C*a" In the above 
claaa than there, will any of tht other gradea. As one moves 

_ * 

106 



umntd or donnward relative to "C'the nuaibA of frequencies 
iiill^ decreaae (t*e» will be niore "B's" than "A* a" and more 
"D*a" than "Pfca"). >€he nui*er of "B'a" will be approximately 
th^ame aa the number of "D'a"; the number of "A' a" will be 
roughly equal to thft_numbcr of "F'a". We are not necessarily 
recowndlng this procedure of grading, but rather are Just 
4 presenting it aa a syatem that acme teacherswuse. In fact, 
we should make you aware of its major dlpadv&tage, namely, 
that not on all tcata do certain numbera of students deserve 
to receive a particular grade, auch as ''A" or 'B or F . 
Here la a brief example that ahould llluatrate this problem. 
t)n the written teat for Volleyball II, all 10 students re- 
ceive a score of 90_j)ercent or above. The result is a whop- 
ping claaa mean of X - 95. When a T score is computed for 
John, who received a acore of 91 (A points below the mean), 
it tuma out to be T - "AS. Using somethlnfe^ similar to the 
above grading 'a chen^, the reault will be ytest grade of 
••D" for John, •'But he knew 91 percent of the Information 
tea ted I Perhaps you- can aee where such a system would be 
unfair. If a criterion-referenced aystem was used, which ^ 
specified particular gradea for varloua test scores (re- _ 
gardleas of hoij the class does as a whole), John would p*o- ^ 
bably receive a "B" or an "A". As far as John and many others 
woult be concerned, one of the letter two gradea would-be g 
great deal fairer than the "D-'Cestabllahed by the curve. 

In summary, the T and the je are atandard acores. They 
place score valuea relative to the meanX)f a distribution, 
and thus provide atandard baaea for comparlaona J^^J" 
f'erent teats. There lace other, common atandard acor*8N)esldes 
T's and a'a. IQ, for example, la derived through a Standard 
Icore formula. You. probably know that valuea of 120, 135, 
etc. are interpreted aa representing fairly high IQ a. How 
are thoae valuea obtained (they do not Imply haying to ana- 
wer 120 or 135 (^ueatlona correctly on a teat)? The formula 
la tWis: ^ 

IQ score - 15£ + 100 

Thus, regardless of how many -questions are on an loTteat, an 
appropriate IQ value can be derived by converting the stu- 
dent's test score to a z score and 'then applying the above 
formula. (Note:. Some IQ formulas >u8^ slightly different 
standard deviation values.) 
Exfl MDle Problem ! Murray Binet, an unemployed psychologist who 

Relieves fiMt he la a descendant of Alfred Binek, devises what 

he believes to be a new test' of IQ* The test consists of read- 
iijg fiW newapapar "advice" columns and answering four toulti- 
■ pie-choice questions on each. He assumes that the. more qufea- ^ 
tlons that one miases (20 misses are possible) , the higher 
his/her IQ. The test 'is normed using 500 people sampled in 
. various laundromats across the country. The overall test re- 
's suits yield X - 12 (missed) and o - 3. Jeanette H^fnagor, 

/ * 
f 

* 

107 , 



A housewife from ItilwAukee^ nlsaies only 2 out of the 20 .Items 
end thus gets s score of 2 (since this test works on the basis 
of number Incorrect). Can you figure out What her($IQ would be 
represented In the standard IQ units? ^ 

^Solution Steps n 

(1) First find s - X - X ^ 2 - 12 - -10 • -3.33 

- - 3 . 3 , 

(2) Then convert to an IQ score: * 

IQ - 15z + 100 - 15 (-3.33) + 100 
- ^ - -50 +100 

IQ - 50 

Thus 9 Jeanette^ as a result of answering 18 out of 20 questions 
correctly 9 Is determined to have an absplute score of 2 misses 
and an IQ " 50. 1 

♦ . . • ' 

Is this really her IQ? No^ unless testing specialists and the 
measurement cooQunlty xln general regard the test aa a valid 
measure of IQ^ which seeou highly unlikely. The moral of the 
story Is that any test scfiores can be converted to the IQ stan- 
dard by the above procedure. Uhat makes the scores repre- 
sentative of Intelllgemce Is agreement by measurement and IQ 
experts that the test^does^ In' fact^ characterise people In 
terms of that construct. 

Here Is a final example of standardisation. You miy have 
heard people talk about their scores on the Scholastic Aptitude 
Test (SAT) 9 used In many cases for admission to tindergraduate 
programs 9 or their scqxtes on the Graduate Record Exalslnatlon 
(6RE)» used for admlss^ to graduate programs. 

The standardization procedure employed In these tests Is: 

GBE (or SAT) score « lOOz + 500 • 

Some quick calculatlons^would show you that an average score 
(z - 0) would convert to a 6RB standard of 500. If you scored 
one standard deviation above the mean (s « -fl.OO), that would 
convert to a standard score of 600. Three stsndard deviations 
above (z - +3.00) would yield' an 800; liiree below (s * -3.00) 
would yield. a 200, etc. 

It*s now time to present the POSTTBST <mT_»coxB» as was 
promised many sentence* ago. ^ ^ 



108 



12 



y 



POSTTBST (mtwera In back) 



1. (kmvart the follcmlng £ scores Into T scores. 
m. +2.00 b. 0.00 c. -1.56 "d. 40.72 e. -2.4Q 

2. I give an exsBilnatlon and find that X - ,22 and a « 8. 
Determine j[ scores for the following raw scores. 

a. 14 b. 22 c. 26 d. 16 e. 34 x 

3. I give an exam and find that X <- 68 and a « 7. Find 
the raw scores for the following T scores. « 

a. 50 b. 40 c. 35 d. 62 e. 47 



Inatnictlonal Unit : Normal Curve 

The normal curve t^^ll-shaped curve, or - technically - 
the Gaussian curve» Is singly a smoothed out frequency 
polygon where the bunching of the frei^uencles occurs at * 
the middle and the stringing out or lower, flatter .portions 
of the curve occur at the ends. 




\ The li(»ortance of the normal curve Is not that It la the | 
Ideal curve» or tiiat It has some magical properties but 
rather that mtiky human characteristics tend to produce f re» | 
quency polygons that take on the normal or^Gausslsn form. 
Since many things in nstu^e fit this pattern and since there 
are certain known matheiaatlcal properties of the iiorttal 
curve (as with other curves also), scientists and tttjfmea 
alike use it. for various types of quantitative Hork. 

4 



109 124 



for this particular unity the Instructional objective 
l8 that you gain a general understanding of the properties 
of the normal distribution. Tou will not have to derive Its 
properties mathenatlcally. Butt can you describe In 10 
words or acre what the normal curve suggests? Suppose^ you 
were told that a distribution of scores falls Into the' nor- 
mal curve^ pattern - Hhat does that mean?. 

Examine the normal curve diagram and note that the figure 
irepresents a frequency polygon of the type you studied <[and 
constructed) In Ihilt I. Perhaps you recall that frequency 
polygons may differ In terms of their shape. Some of the 
patterns we studied were the rectangular^ bell-shaped 
(actually another name for the normul curve) » triangular » 
negatively skewed » positively skewed » etc. So, when we 



diiscuss the normal dlst^rlbutlon^ we are referring to a fre- 
quent polygon that Is unlmodal, symmetrical, and bell^ 

shaped, and more!! 

So, If a series of scores fall Into a norm&l distribu- 
tion, what does that mean? The Interpretation Is fairly 
Strals^tfozward. Looking at the X and f axis, we can say 
the following: the highest frequencies (I.e., the most 
"popular^' scores) fall towards the middle of the distribu- 
tion; the mean, mode, and median are essentially Identical; 
and as you move farther and farther from the mean (or cen- 
ter of the distribution), the frequencies Of scores tend to 
decrease. . ^ 

Height 9 for examp)te, tends to be normally distributed. 
If you measure the height of every male In the bnlted States 
and plot a frequency polygon, the resultant curve will re- 
semble the bell-shaped (normal). The highest point on the 
curve will correspond with the mean height In the population, 
let^s s^r» approximately 5' 9". As 'you move farther and 
farther away from the mean In a positive direction, the fre- 
quencies will decrease: more men will be 5' 10" than 6' 9"; 
fewer will be 6* 0" than 5* IT; and so on. Finally^ as 
you move toward the very extreme positive end of the dis- 
tribution • (siqr» for etxaiqple* heights greater than 7')» 
frequencies will be extremely low. Since the curve Is sym- 
metrical, the same thing happens when you move away from the 
mean in the negative direction: Hore men will be measured 
at 5' 8" than at 5" 7"; more at 5*5" than at 5' 3"' and ex- 
tremely few^t less than 5*. The curve should be 'symmetrical, 
which means that half of those measured will be above the 
mean, whereas half will be below the mean. Even more 
specifically, there will be the same frequencies of scores 
at positive and negative points equidistant from the mean: 
that is, f»s should be equal for 5" 11" (+2" from mean) 
and 5" 7'^(-2" from mean) ;f or 6' and 5* 6"; for 6*1" and 
5' 5"; etc* 




12S 



POSTTEST (antvcts In back) 
1. Describe the nomal curve 



2*. Make a sketch of the normsl curve. 



f 

0 




2b* Suppose that the abo^ diagram (assuming you drew It 
correctly) represents the distribution for the number 
of haiaburgers eaten In a contest held on a college 
campus. In other words > iiUXtn the scores for all par^ 
tldpantSr represtoted as the number of hamburgers 
eaten by each> were tabulated and plotted, the result 
was a normal distribution of scores. The mean of the 
dlstrlbutldii was 10. Assuming an exact normal dis- 
tribution. Identify each of the following statements 
as "true," "false," or "can't answer." 

^1, The mean score of 10 would be positioned on the 

"X?* axis directly below the highest point on the 
curve. 

2. There would be more scores ibove 10 than below 10. 



3. There would be more scores below 10 than d>ove 10. 



4. The probi^lllty of finding a student who consumed 
13 hairf)urgers would be greater thao^ndlng one who 
consumed 10 hamburgers. 

* 

5. A score of 15 should be equally cotBDon as one of 5. 

6. The Mode of the distribution should be 10. 

7. The Mdlan should be about 11. 



8. the probability of eating 8 cheef ebutgei^a would be 
greater than that of eating 7 cheeseburgers. 

Ill l^t) 



4.5 




Rnr Score and score scales In 
Normal Distribution 



No^TMlly, on the baseline of a frequency polygon, you 
have;/a raw score scale. These scores may be heights, IQ*s, 
learning rates, 6tc. On this sjcale we would typically place 
points that at least represent X and several a's on either * 
side of X. In addition to the raw score scale. It may be 
necessary to label some other transformed or converted scale, 
The most frequently used transformed scale Is the 2^ sc£re 
scale. Given a normally distributed set of data with X » 70 
and 0-6, the raw score and\2 score scales would look some- 
thing like the diagram below. 



Raw Score 
z Score 




52 
-3 



58 10 76 62 ^8 
•2 -I 0 12 3 



Hdtlce that three 'a units have been marked off on either 
side of X; while this doesn^t take in every possible score » 
it is r|re for any distribution to spread out more than 3a 
around X. Also nojte that the 0 units have been evenly spaced 
on either side of X so that lo represents the sane distances 
at all places along the baseline. U cxmU bt mohaAlztd too 



Aoivma a wi/u£Xy a vfujbabAJUXti and AXXuUAtLcaz pfinbum. 



Final Mote; 



Ihe nozaal curve 1m not «n Ideal curve but rather a 
curve that seeiw to accurately describe many, huaum charac- 
terlstlca. ) 



112 



127 



POSmST Onunrers In back) 



I. Makt a sketch of the norael curve In the space belov and 
labels as accuratelx^ as possible a tm score scale and z 
score icale vhere X « 30 and a • 4« 



Raw Score 
z Score 



2a. Assuae that IQ scores- are normally distributed In Mrs.* 
Gordan*s 12th grade class. If ve toake a plot of this 
distribution, vlll the helots of the curve at 4*la and 
-la be equal or different? Explain. 

2b. Asi^uae that X • 98 In Mrs.- Gordan^s class, and give 
anivets to the folloirlng: 

I. Over uhlch ecore point will the height of the 
curve be greatest? 

2m Which score should be nore coiBBon, a 106 or a 90? 

3. Is a score of 50 possible? Why or why not? 



Instructional ftilt : Percentages of scores betireen 

standard deviations 

* . 

One useful property of the noxaal cujrve la that a con- 
it ant percentage of the total area (aasune total area ■ 
100Zr«ill f«ll betnecn fixed x scores on the baseline. 
That ls» a. certain constant perieentage will fall between jt • 
0 (i.e. , X) and ji - 1, or between £ • -1 and £ - -2, and So 
on. 

Regardless of the raw score aean and stsndard deviation 
of the set of nonMlly distributed data, the percentage of 
the totai area between two fixed £ scorss will be the ssae. 



113 



123 



The exact area between any two scores can be found using 
a nonud curve tabla, as will be discussed In Units 4.7 - 4.10. 
For present purposes, though, we will diagram the percentages 
between whole nuaher z, scores. These values are of ten helpful 
to aeaorlze, as they provide a ready basis for estimating per- 
centages of scorers tn any normally distributed group. For 
percentage areas between fractional jc scores , use the normal 
curve table (we'll worry about this later). 




area of curve 



rounded off area 
(memorize these) 



-3 to -2 
-2 to -1 
to 0 
to +1 
to +2 



-1 
0 
+1 



+2 to +3 



2.14Z 
13.59Z 
34. 13Z 
34.13Z 
13.59% 

2.14% 

99.72% - 100% 



2% 
14% 
34% 
34% 
14% 

2% 



Note: 



The reason that the total Is not exactly 100% Is that, 
technically, the curve does not stop at the z scores of 
-3.00 and +3.00, Actually, the curve goes to »+" or infinity. 
However, only .28 of 1% fall outside of z scores of -3 and 
+3; therefore, not much Information can be gained by ex- 
tending the curve beyond three standard deviations in 
either direction. 

If you think you understand the material on areas of the 
normal curve, see If you can answer the following questions: 



114 

. 123 



«) Tou gave a math teat, the reaults of vhlch were 
nozmally dlatrlbuted. What percentage of atudenta who 
took the teat acored betueen 0 and *flcr fron the nean* 

Axmukxi 34Z of the atudenta fall Into that range* Khy? 
Aiqrtloa data la normally dlatrlbuted approximately 34Z 
of the dlatrlbutlon vlll be between a acore of 0 (that 
la, the mean) and a acore of +1. 

b) On the same math teat, what proportion of the 
atudenta acored betveen _r ■ *1«00 and £ ■ -M«007 

Anawer : 68Z; anytime data la. normally dlatrlbuted, 68Z 
of the acorea will fall vlthln - 'V' or "-"1.00. 34X are 
between £ ■ -1 and £ " 0, and 37z are between £ « 0 and 
£ - -M* ^34 + 34 « 68* Technically, the anawer la 68.26Z 
(34.13 + 34. 13). 



.6 POSTTEST (anawera can be checked on precc^dlng page) 

On the normal curve below, label the approximate per* 
centage valuea between the 'vertical lines. (Do thla 
without looking on the previous page; otherwlae you will 
merely be demonstrating that you have maatered the art of 
copying numbera.) 



f 




a: -3 -I 0 ^3 



130 

115^ 



iMtruction*! Pfllt t Fiadtng percentage of scores between 

fmy two s's by use of "normal curve 
table 

As the title of this unit suggests, we will be working 
with something called a "normal curve" table, or more 
specifically, "area under the normal curve" table. Coincl- 
dently, one Is provldedHn Table I In the Appendix. That 
is the table to work from once you are ready for the actual 
problems. First, however, we'll pre'sent an abbreviated 
version to show you how the table is used. 

In the preceding unit, you meaorized some percentages 
which indicated the proportion of scores, "in a normal dis- 
tribution, between whole nunber £ values such as 0 to +1; 
+1 to +2; -2 to -3; etc. Presuaably that did not overly 
tax your wemovf. You proT>ably would object quite a bit, 
however, if you were asked to memorize the proportions 
for all possible combinations of £ values; for example, 
-2.78 to -1.03, etc. Thus, to find the percentage of scores 
between any two z values, we make use of an "area under the 
normal curve" table. If you can learn how to use the table, 
you can easily determine the percentage of scores between 
any two z, valiies. 

The table specifies the area or percentage of the total 
normal curve between X (for which z always equals 0) and 
some other z value (whichever is of interest to you). The 
tafcle contains two types of information: a) £ values, 
which a«e located in the first column on the left (along with 
hundredths values at the top); tod b) the percentage of scores 
that fall from X <z - 0) to other z values; these are the f ig^ 
urea in the main body of the table. The general outline of 
the table is presented below. (It would probably be valuable 
to also take a look now at the full picture in Appendix Table I.) 

Area Under Hormal Curve Table 
z (hundredths) , 





.00 


.01 






0.0 


.0000 


.0040 




. . .0359 


0.1 

• 


.0398 


.0438 




. . .0753 


• 

i!o 


.3413 


.3438 




. . .3621 


1.1 

• 


.3643 


.3665 




. . .3830 


• 

3.0 


.4987 


.4987 







116 



I3i 



1^ use of the table ve can flod the percentage of scores 
that fall between any tvo specified values. Keep In «lnd 
that the figures ahovn on the table represent the area from 
s^ » 0 to any specified jb value. For example « suppose you are 
Interested in determining the percentage of scorea In a normal 
distribution that fall betveen the mean » 0) and 1.5 
atandard deviations above tlie mean (s » 1.5). (You will need 
a complete table to follow this exsmple.) All you need to do 
Is locate s^ » 1.5 on the tablet since no hundredths are In- 
volved (as vould be the case If «■ 1.51t for example) » move 
dovn the vertical column until y?u reach jb « 1. 5. Immediately 
adjacent to the £ value » 1.5 (In the colwi corresponding to 
'*no hundredths" ~ .00) » Is the area value, ^332. Find this 
value now, othexvlse you will ba totally lost (If you aren't 
already). Move the decimal place on the area value tvo 
places to the tl^t and you will end up with 43.32. This 
means that 43.32% of the scores in a normal dlatrlbutlon fall 
betveen £ " 0 (the mean) and£ « 1.50. Notice on the table 
that the'^llsted £ values range from 0.0 to beyond 3.00. Since 
the normal curye"*ls synmetrlcal and has 50Z of the area on 
each side of X, the normal curve table only has to account 
for 1/2 or 50X of the area because the other hall Is Identical. 
Keep In mind that the t values In the table csn be either 
or 

Ihe above description may seefa pretty heavy, ao let's 
backtrack a bit. Actually all that was said Is as follows: 

(1) locate the jt value you are Interested In on ''the 
area under tiie normal curve" table. To do this 
you move down the vertical column (Indicating" 
vhole number £*s and tenths) and across the first 
row (Indicating hundredths). 

■f 

(2) the 4~dlglt numbers you find Indicate the propor<» 
tlon of scores (or area) In a normal distribution 
betveen £ * 0 and "your" £ - the £ score you 
located."* 

The remaining sections of this unit will present soms ea^ 
ample problems Intended to Illustrate different prototype 
applications of the "normal curve area" table. These bsslc ^ 
prototypes will also comprise the focus of the three units 
that follow. They consist of: 

A. Finding the area betveen £ - 0 and some 
specified ('•+'' or "-") £ score. i 

B. Finding the area betvera a "4^' and a "<-" £ 
score. 

C. Finding the area betveen tvo "+'s" or batvaen 
tvo a. «» 

D. Finding the area above or belov a "4^' £ score. 
B. Finding the area above or belov a "-" £ score. 



117 



132 



6iv«n that these types of probletns can apply ^J/f f 

are given, or where raw scores are given which mxst be 
ojnverted to s's, there would seem to be at least IQ dif- 
5^re" -ppUcItiJns to learn' (or n«morlze). Take heart 
this does not have to be the case at all. From this point 
onward, the remainder of material all relates to a . 
limited nu»i,er of fairly simple concepts. If you understand 
. the concepts (which mostly involve being comfortable with 
the "table" and "normal curve" diagrams) , you should be able 
tor .olve any o£ the prototype problema presented, without 
hJvlng to aemorlse the specific steps required for each. 
It !• all a logical process. So, "meaningful learners 
h«v« « (fairly easy) problem-solving exercise to look for- 
ward to. itote meaorlsers, however, beware! You have a 
long road to travel, filled with the usual pitfalls of 
ha^g to rehearse things that you don't understand, and 
theTconfualng the steps memorited for different problems, 
or forgetting them coivletely, at the time correct Proce-^ 
dure. Med. to be applied. How will you know if you are one 
of the respected and revered meaningful learners or one of 
' the dastardly rote learners? There Is no rf,solute criterion, 
but It would see. that you are probably a -J^J-f ^^^^ 
if. given an area p.roblem, you can represent what it asks in 
a iental image or, much better, in an actual diagram of the 
JofflS curve! Yoi are probably a rote learner if you can 
deSJe the^orrect an»rers to problems, but cannot represent 
t^ yol are doing in picture form. Beware t^i' 
of the full moon, as meaningful learners have been known to 
?u» into rote liamers on such nights. The early danger 
sip are rehearsing things to yourself at meal times and 
writing formulas down on little pieces of paper which you 
iSSln pSSet. of different shirts. (Ihe reco«ended cure 
for this condition, sometimes called "rotewolvery. Is to 

rileiS of gaillc following each recital; if nothing else, 
it will keep your statistics Instructor away.) 

Prototype A: What is the proportion of scores in a normal dis- 
trlbutlon between the mean and £ - 0.^/^ 

First, you should draw a rough sketch of the normal 
curve and pencil In the area you are asked to measure: 




ERIC 



118 



13, 



Rote thAt £ « .32 la po8ltlcme4 a short distance above £« 
0.00, alnce It la a positive value. Exact placraent la'^not 
necessary for these dlagrans. Also note that the area 
between the tvo Va la ahaded, aa thla la what the problem 
aaka you to find. Then you alaply locate on the **normal 
curve*' table the specific £ vaLue ^ •^•52) and record the 
four-*diglt area value* 

Hopefully 9 you agree that for £ •> 0.529 the area value 
la .1985. We nove the decimal place tvo digits to the right 
and end up vlth 19. 85 percent. (If you vera unable to lo- 
cate .1985 for £ ■ 0.52 9 seek heilp Inmedlately you must be 
totally cotid!ortable vlth the *^normal curve" table In order 
to proceed further vlth any chance of auccess.) In any caset 
the ansver to thla problem la 19.85% of the scores In a nor- 
mal distribution are betveen £ 0 and £ - 0.52. 

Another Example : What proportion of scores In a normal distribution 
* fall betveen the mean and £ « -1.89*. 

The fact that ve are working vlth a negatlv^£ acore 
ukes no difference alnce the normal curve Is symmetrical. 
The area betveen the mean and £ "> -1.89 li Identical to the 
area betveen the mean and £ « Tl.89. Thua^ ve tise the table ^ 
In the exact same manner for both positive and negative £ 
scores. * 

Firsts mske a sketch of the normal curve and pencil In 
the area In queatlon^ I.e. ^^he area between z ^ 0 and a « ^ 
-1.89 (note that £ » -1.89 vas positioned *'some dlstanc?^ ^ 
belbv the mean^ aTnce It la negative). 



f 




£ -1.89 0 



Examining the "table/* ve find that the proportion listed 
for £« 1.89 is •4706. Thua, ve can conclude that 47.06X 
of the acorea fall betveeh the mean and s ■ •-1.89. 



134 



Rov. iet'« einlvh thU iMtructlonal unit by winning 
four .ore prototype.. Each prototype relates to a specific 
md ttil<lue type of "area" problem. If you try to follow 
their logic, they all ahould be fairly easy. 

Proto^e B : Finding percentage of scores between positive and 
negative ^ ,» cores . 

What is 6ie percfmtage of scores between £- -.19 and 
£ - t3.027 

Before giving up, sake a sketch of the normal curve and 
pwicil in the approalwite *rea you are trying to find. 




TH- ^nr 

t, luc^ the £ - -. 19 



3.02 



First *l«6e the s - -.19 soi«where below the mean, and the 
E - +3.02 somewhere above the oean. ;Ehen reed the pfoblem 
^gidffl. Mote that it"55* «o' the area between gese scores. 
P^ps by looking at the diagraj. you can gain ioae insight 
sa to the correct procedure. ^ 

The diagrii^W»t- '^at t° 8° ri^^S?*"* 

deviations below X to get Area "A"; According Jo the 
norMl curvJlAle, the area for « - -.19 is 7.53X. IP 
Edition, we need to go J.02 standard deviation. |bo||. X 
to get Area "B"; the table .how. this area to be 49.87X. 

Since our interest is in the total area between £ " " 
sad s - 3.02 (that is. Sections A & B), we add the two 




7.53% + 49.87Z 



57.40Z 



Our answer: 57.40Z of the.scores in a nonwl^distribution 
^1 between a, -. 19 and e, - 3.02. 

note: With the diagram, this type of problem should be eaa^; 
without the diagram it could become confusing. Always 
tne a diagram I 



120 



13, 



Prototype d: Finding p«rc^t«ge between two positive e^ecotee, 
or between tiro negative e^.ftcovee. 

Iftut la the percentage 61 acorea that fall between je - 
.19 and jf" 1.12? Make a aketch of the normal curve, plkdng 
both a values on the positive end, but the +1.12 aonewhat * 
hi^eT. Then pencil In-Ihe area of interest. 




s 



0 .d uO, 



Your 



answer: 



To find the appropriate area, first look up the' area for 
a • .JJ9. This tells you that 7.S3Z of the scores range 
txon X to s » . 19^ (or area in Section A) . Then look up 
area for £"» 1.12 which la 36.86X; this represents .area 
froB nesn'to £ ■ *1.12 (or areas in Sections A and B). To 
solve die problem you omat get die value for Section B alone. 
So, you subtract: 3^^ 86 - 7.53 - 29.33Z (A+_B - A - B). 

approprl^te-^ea • 29.33Z.^ ^ ' 

For this par^ular problem, what we had io do leis find 
the diaded area (B) by treating it aa the remainder of the 
whole (A + B) «inus the part (A)* It* a all a logical pro- 
cess, and there^ls tmv€t ray need to panic. Just dr«f the 
curve, fill-in €he givtna» reread what the problem asks, and 
then attack It step-by-atcp. As you locate a particular area* 
from ^e table, it*a u«ttf|ly a good idea to mark it in the 
alppropriate place on your diagram. That w^r, your diagram 
will contim»lly prepcnt an up-to-date picbire^^f where you 
atand on die problemi For the. probloi we Just solved, the 
areas would be marked In the manner ahown on ^e next page. 




ERIC 



*21 



13o 




0.19 1.12 



Another Batanple ; Finding percentage between two net^atlve z 
scores. > 
, ■ > * ' 

What Is the percentage of scores that fall between £ - 
-3.02 Ind £- -1.09? 

Make your sketch, being sure to position the -3.02 somewhat 
loveY on the scale tha^ the -1.09. Then shade In the iarea 
of Interest, l.e.,rthe area between the two £ scores. 









« / 











/ • -3.02 ' ' -1.09 0 

Uaing the table we find that from 0 to -1.09 (look up 1.09), 
we have 36.21X. Thla la Section A. From 0 to -3.02 (look 
up 3.02), we have 49.87X. Thla accounta for Sectlona A and 
B. To find Section B alone: 49.87X - 36.21% - 13.66Z. 
(A » B - A « R). ' ^ 

Our anawerx 13.66 Z of acorea fall between - -1.09 and z, ■ 
-3.02. The procedure la exactly the aane aa for the poaltlve 
B acorea in the preceding exaaple. 



122 



i3/ 



Prototype D: Finding the eree above Or below e r. 

i 

Nluit percentage of acores In a nornal dlatrlbution vould 
be above i"* +«87? 

First » draw the normal curve aketch» being aure to place 
the i of .87 aoneufaere above the sean^ Reread the problem; 
vfaat area la It aaUng for? Tou are right If you're think- | 
Ing the area above the £ acore. Shade that area In as ahown | 
below. ' • 





* ■ 


> 






\ A 


f. 
















1 









.87 



Now the task Is to find the percentage tha^ 
to the ahaded area only. We look up the area 
and find It to be 30.78Z. Is thla the answer 
"yes," you are not paying attention to the pro 




eaponda 
for .87 
you aay 
, becaiiie 

J ««w y J w«« r — -"o — — • * 

the 30.78% Ijelongs between £ - 0 and s - .87. We are looking 
for the area beyond .87jB. How do we proceed??? 

If you are atumped» here Is a clue (actually a proiq>t): 
How ifcich area la above the mean in eVery normal curve that 
ever esdsted? 



Answer ; 50 Z above; below. 



Thus, the solution to th| problem Is* 50 (A ^ B) - 30.78 (A) ■ 
19.22X (B). You are ta^ng the part (A) away from the whole 
(A + B) to find the remainder (B). 

Anotbbr Exaamle: "What percentage of scores would be below 
a -1.28? ^ 

■ ■ » . ^ 

First, make your sketch, piecing the 1.28 somewhere 
above the mean. Reread the problem and note that It Is 
asking for the percentage below the % score. Shade In that 
area as shown: ^ 



123 



138 




How do we t>roceed7 The only logical place to atart la 
with looking up the area for 1.28jb, which turns out to be 
39.97:i^. la this the answer? NO! Ill It la only part of It, 
aaour nlaalon la to find the total percentage below 1.28a. 
Hhat percent If below £ « 0.00? Fifty percent, aa alwaya, 
la below the nean. Ihua, to get the total below we simply 
add 39.97 (area B) and 50.00 (area A) to get 89.97Z. 

Prototype K : Finding the area above or belbw a £. ^ 

Nhat percentage of acorea would be above a ■ «-2.00? 
Bare, we are confronted with a Imply a mirror Image of the 
'V s^ problems. Ihe flrat atep, aa uaual, la to make a 
aketdi and ahade In the afea of Intereat. Thla time the 
area la the total portion^ jbove •»2.00£, aa ahown. 




Ife look up the area for -2.00jb, and find that It la 47.72Z. 
The Important thing to note la that It repreaenta area A. 
Area of courae, im 50%.^ Thua, the total percentage 
above ^2a la 47.72 (A) + 50 CB) - 97.72% (A + B). 



124 



ERIC 



133 



LMit iMWPlft Before Po«ttftit 

What percentage of acorea are below -0.52s In a nomal 
diatributlon? Diagram looka like: 




Z f-52 0 

The ahaded area remlnda ua that we are looking for an area 
below the a acoxe. Thes tabled value for -.52£ la 19.85%, 
and on the^dlagraa It r4preaente Area B. The total area ^ 
below the nean la 50% (Area A ■¥ B) . Therefore, we proceed 
aa followa: 50 (A + B) - 19.85 (B) - 30.15% (A). We are 
done ill 



Final Hotea ; 



1. 



2. 



Do not attempt to memorize the five prototypea. Try to 
underatand them, and to see how they all reUly Involve 
the aaae concepta. 



Be aure you can make a sketch of any of the prototypes. 
Artiatry ±k not Important unleas you want to uae your 
normal j^ftve aketchea for your bulletin board or port- 
folio. 

It ahould be pointed out that the above exeiiclaea Involve 
the m#yt «iim work ever needed to solve area problema. 
Some more elaborate normal curve tablea not only Hat the 
areas between the mean and the a. acorea, but alao provide 
the total areaa both above and below each of the a, a. 
This information would cut abort aome of the aolution 
atepa we went through above. But, if you know the full 
procedure, you are naturally prepared to work with any 
normal cunie table. A little more work at thla atage 
could be valuable later. 



125 



140 



4.7 POSTTBST (nmen In back) 



AasioBlng a nomal curve, find the areas specified by 
each of «:^Jfolloving jB^ score boundaries, and represent 
each proble«^ln a nomal curve aketch. 

!• between ■ 0.00 and r • 2.00 

2. betmen jg - ••90 and 7 - -.20 

3. between £ — 2^20 and £ - .70. 

4. between - 0.00 and jb - -2.50 

5. between T - .70 and £ - 2. 10 
6# above £ - .45 

1. below £—1.12 

8. above £ - -.70 

9. below • £ - +.98 

10. (BONUS) above £ - 1.96 OR below £ - -1.96 

4. 8 Instructional Unit : Finding area between any two raw 

scores In a normal distribution 

Ihla la one of those Instructional units that really 
does not offer anything new; It merely aaka you to think 
again about concepts that presumably have been mastered 
In previous sections. To get through this Instructional 
unit with esse» you need to know: a) how to convert raw 
scores to £ scores given the mean and standard deviation 
of the distribution (Instructional unit 4.1), and b) how 
to find the area between any two £ scores In a normal 
distribution (Instructional unit 4*. 7). 

In other words, this unit Is going to be very^slillar ^ ^ 
to the preceding unit. However, Instead of being given 
two £ scpres, you will be given two raw scores. Your task 
will be to find the area between the two raw scores. If 
you remember how to convert the raw scpres into £ scores 
(if you do not remember you should not be this far ahead 
in the >unlt) , then you end up with exactly the same kind 
of prdoiem you attempted to solve in instructional unit 
4.7 (turn back one page). It should be great fun. 

0»K., let* a get through (his quickly. To find the area 
between two raw scores in a normal distribution, first con- 
vert the raw scores to £ values and then use the "area un- 
der tlie normal curve" table to find the area between the 
two resultant £ scores. To assist in this process, it is 
again essential to uAke a simple sketch of the normal curve. 
In the sketch* Include a raw score scale with (see next page) 
corresponding £ values. Then pencil in the area you are 
trying, to measure. 



14 



■ •■ ■■ „ 

ExMPle Problca : Wh«t 1« the *r*a (pfrcentage of scores) 
between the raw scores of 38 and 44 In a 
dlattlbutlon where Z - 40 and a - 27 



Make your diagram: 




Raw: 



Convert the raw scores to £ scores and Include a j_ score 
scsle on the dlsgram as shown above and below. 

For 38: t - 38-40 - -2/2 - -1.00 
2 

For 44: z - 44-40 - 4/2 - +2.0(^ 

~-: 2 

You will note that this is a problem in which you must 
find the area between a positive z score and a negative z ^ 
score. Thus, you find the percent^es in the "normsl curve 
table for each z score and add the» together (A + B - A + B). 

The area between X snd a. - -1 is 34.13%. Between X and 
s - 2, you get 47.72%. Therefore, between £ - -1 and je - 2, 
or raw scores of 38 and 44, you have 34.13% + 47.72% - 81.85%. 

Answer: 81.85% fall between icores of 38 and 44 on thia 
particular dlatrlbution. 

Example Problen : What is the area between rw scores of 45 and 
• - ■ . . 41 in a dlatrlbution where X - 40 and o - ^7 



Make your diagraa: 



ERIC 




Coavmrt the tm scores to r scores end Include a score 
ecele o& dlegrsn. 

For rsw score of 41 : z ^-AX - 40 « 1/2 « • 5 

For rsw score of 45: s - 45 ^ 40 - 5/2 - 2.5 ^ 

. ^ 2 ' ' 

Rote thAt this Is s problem In vhlch yx>u must find the 
sres between two poeltlve scores. Thus, find the per* 
centege values for each anT subtract the ^art (A) from the 
Whole CA B) to get the missing Part (B) . 

Between X and £ " .5 you get 19.15Z (In Section A). 

Between X and £ - 2.5 you get 49.38% (In Section A & B) . ^ 

For Section B alone : 

49.38% • 19.15% - 30.23% (A B - A - B). 

Our answer: 30.23% of the scares fall between the raw 
scores of 41 and 45 In this particular distribution. Keep 
In mind that 30.23% of the scores do not always fall be- 
tween raw scores of 41 and 45. They do In the above exam- 
ple, however, because the corresponding « scores, +.5 
and +2.50, happen to Include that area. Think about It. 

Final Motes : f 

1« Always sketch out problem. If you cannot, you should 
return to Uhlt 4^7 and review the procedure for the 
diagrams. 

2. Given raw scores, first convert to £ scores and then 
find the area between the two z values. 



4.8 POSTTEST (answers In back) 

For eadi of the sets of data below, find the percentage 
of scores specified by the listed raw score boundaries. Make 
a diagram for each. 

1. Assume X ■ 72 and o ■ 4. Find the percentage of scores: 

a. between 66 and 71 
b* between 72 and 80 
c« below 68 



ERiC 



128 



2. 



5Q0 and a " 100. Find the area: 



a. between 

b. above 

t. above' ^ 



450 and 625 

480 

530 



3. Aaauae X « 22.1 and o 3.4. Find the area: 

a. belov 21.3 

b. between 22.0 and 24.0 

c. between 20.3 and 21.9 

Inatmctlonal Unit ; Converting i scores Into percentile 

ranks 9 and percentile ranks Into 
scores 

As you found out In Unit II » a percentile rank Is the 
percentage of scores (of total N) that fall belowssome point 
In the distribution. If Fred's percentile rank ls'P34, then 
we can say that 34Z of the scores In the distribution fall 
below Fredas. No problen. 

Also In Unit II » you learned to use the Interpolation 
equation to conpute percentile ranks... and if you tire llk^ 
most people^ you probably disliked doing that inanensely. 
Wellt here is soae good news: if you know the person's £ 
score in a nomal distribution, you can rather quickly 
conpute his/her percentile rank by use of the "area under 
the normal curve" table. As a matter of fact, you have al- 
ready had experience with this type of problem in the past 
two units. 

Any time you have been given an area problem that asked 
you to find "the percentage of scores below . . . »" you were 
really being asked to determine a percentile rank. Maybe 
you realised this, but probably, with your attention dl- , 
rected' to the different normal curve problems and the making 
of beautiful diagraM. you did not. To capitalise on your 
past experiences (assuming you do complete the posttests or 
at least stare at them for awhile) » turn back to Posttest « 
4.7 snd look it problems 7 and 9. Both ask you to find the 
area below a epeclflad z score, a "-" one in #7 and a "+" 
one in #9. You have come a long way in your ^udy, if you 
can see that bdth problems really ask you to sMcify the 
percentile ranks of the given ji's, i.e*, the pe^^^tage of 
scores that liould b^ positioned lower than they, get a 
feeling of progression, now turn fozward to Poattest 4.8 and 
examine ntobleas Ic and 3a, It*s the same story; both ask 
you to^Hnd the percentage of scores below the given (rmir) 
scorel; both ask. In essence, for the specification of the 
percentile tanks for those scores. It should be obvious 
to you that the percentile rank for a score or raw acore 



129 



144 



poaltloned below the aean taiiet be leae than 50 CP^q) ; for a 
£ or raw acore poaltloned above the nean, it nuat be greater 
than P50. What thla aeana la that if you calculate the 
percentile rank of a acore equal to, aay, +1.00 and deter- 
mine It to be aoaethlng like P34, your anawer haa to be wrong 
and. In fact, la downright eobarraaalng. In auch an Inatance, 
you would hav« forgotten to add 50Z to the tabled value for 
£- 1.00 of 34% to get a Bore decent looking Pq^. Let's, 
briefly review the procedurea, although you ahould know them 
quite well at th'la point. 

For noraal dlatrlbutlona, percentile rank problenta are 
Identical to "area below" problems. To find a percentile 
rank for a £ acore, you need to find the area below that £ 
acore. To find a percentile rank for a raw score , you need 
to: a) convert the raw acore to a £ acore, and b) then find 
the area below that £ acore. The systen that we have been 
using throu^out thla Unit, but especially on area probleoa, 
can be repreaented as followa: 

Score Indices Conversion Procedure 

Raw Score (X) \ 



What thla ahows la that any type of problen which we have 
dealt with can be solved by either one or two converalons. 
Changing an X to a £, or a £ to an X, la a one-atep proceas 
Involving use of the conve talon fommla. Changing a £ to 1 Z, 
or a Z to a £, la alao a one-atep process , but Involves the 
use of the table. Changing an X to a Z, or a Z to an X, la 
a twt>-8tep pro|»as. Involving "going to" z flret and then con- 
verting the z In the appropriate manner. All problem types 
preaented In'thla Unit are aolyed by the steps Included In 
this rather alaple acheae. 

For our flrat percentile rank problem, aaaume that we 
are trying to find the P.R. for a student named Sidney who 
haa Just completed a paythology exam. Sid, as he likes to 
be called, was determined to have scored at £ " 1.30, Thla 
s scorf information is good news for us becauae it elimi- 
natea the need for the "formula" (X-to-£ converaion) step. 
Only the table la needed. We- make the following sketch, 
this ti^ incorporating a Z rank aeale: 



Standard Score ^z) 




formula (unit 4.1) 



Area of Percentage (Z or P^) 



"Normal curve 
table (unit 
4.6) 



t 




To find the area below ^ - 1.30 and thus, the corres- 
ponding P*R.» we need to measure the entire shaded part of 
the curve. The first half, by definition , encompasses 50%. 
All we need, then, is the "unknown" area f rotn » 0 to 
2 - K30.^_ The table, of course, comes to our aid by list- 
ing 40.32% for that portion. What percentage of the cUss 
has Sid beaten? 

50 + 40.32 - 90.32% 

Sid should be proud. 

Anotfaw Example 

In an International women's gyinnastlcs tournament, the 
average rating for participants on the balance beam is 9.20 
with a « .3. Natasha BobollncksKismlth, from lower Ur- 
ksAsas, received an 8.80. What lather P.R.7 (assume a 
normal distribution). 

Looking at the information provided, this has to be one 
of thos*e two-step problems. To get from raw (actual) scores 
to percentages, we must go to ^ first. To keep our sanity, 
however, it would be a good idea to represent the problem 
in diagram form. 



40.82% 




8.80 9.20 



ERIC 



131 



14o 



The dlAgroi ahova that ir«t«alui scored below the row score 
Man, and thua vtll receive a negative (vhen one la com- 
puted) and a P^Rt aoaeiihere helov SO* The converalon la 
the ftrat atep: 

z - 8>80 - 9>20 » zil^ -l^^S 

3 .3 - 

We ahottld llat the ^1.33 on our aketch In place of the "7" 
In the £ acale. HOrir we are ri^Uy for the table. We look 
up the area f or £ • 1.33» and find that It la 40.82%. What 
la Rataaha^a P.R.7 



7- If you aald "40.82" you should be sent back to the salt 
wdntm (with Natasha). That figure represents the area be- 
tueen -1.33b and 0. To get the area that Is below -1.33£t 
we need to iTubtract the 40.82 from the whole (SOZ). (If you 
don't aee thla» cjiftdc unit 4. 7» Prototype E.) The P.R. for 
Natasha Is 50Z ^^AOT^SaL- 9. 18Z. Natasha, apparently, will 
aoon be repairing cartwhMl^^rather than doing them, In 
the Uksark mountains. ^'^V _ y 



Reversing the Process: % to g's 

You can now determine the percentile rank equivalent for 
a £ acore In a normal distribution. Based on this knowledge, 
reversing the process to get a £ from a P.R. should not be 
very difficult. Using the klnd'^of area table that we have, 
there la only one "trick" Involved, and If you can master 
that. It will be amooth sailing thereafter. Let* a see how 
the reversal proceaa works In an example. 

* Example 

, On a nationally given scholastic' aptitude test, Tim Is 

told that he acored at P^e* Assuming a normal distribution, 
what would hla z score be? 



V; \ 

Hake « sketd^, Indicating the glvena and uhknofima. 



Haw: 
s: 
Zt 



It ^ 














— 1 




o 


• 



so 75 



132 



ERIC 



Hi 



Thft sketch shcm ua inedlately that Tin* a score la 
poaltlv^ (P75 daflnaa that)# The queatlon la hov do we. find 
lt« Ubll» if fon reaesher the converaloii acheae we talked 
about a fev pagea back* you can probably Identify thla aa a 
one-atep problea: a Z to a £• Aa defined^ It requires use 
of the table onlyp^e mat Took up an area to find Zj hnt 
here la the trick. Soae people^ who are not thinkings try 
to look up 75Z (.7500) on the table^ because that Is the 
only area they have been glven« We don^t want a £ score 
that haa 75Z of the aco|res between It and the mtm (besides, 
there lsn*t such an anlaal). We want one that %eata" 75Z. 
Since our table worka from a^ « 0 outward, we heed to look 
up the area between the mean and our P.R.: 7SZ - 50Z " 2SZ. 
Thus, the £ acore In queatlon haa 2SZ of the cases between 
It and the*mean. To find It, we acan the areas on the table 
and Identify the one closest to .2500 (25%). It turns out 
to be .2486. The £ that corresponds to 24.86Z Is .67. We 
attach a '*4^ algn to lt,^lnce we are dealing with the posi- 
tive end of the ourvs^-^an^ express the answer as +.67£. It 
la the only £ score below which 75Z of the cases fall. 

Another Example 

On the same aptftude test aa In the first example, 
Sally's performance places her at P^q* The overall test 
mean Is 500, with 0 » 100. What was Sally's score? 



This Is one of those two-step problems, as It Involves 
going first* from Z to £^ and then ualng £ to obtain the 
appropriate rm acore. Flrat we dlagramT and then Identify 
.a* 




% 40 50 

The diagram remlnda ua that we will be working on^he 

negative aide of the curve* We need to find a £ that aur^ 

paaaea 40X of the acorea* but to look up .4000 on the table 

would be an outlandlah mlatake. Sljice the table worka from 

the middle (a « 0), we need ^to define thit £ acore In question 

In terma of ita dlatance from the middle. *The lower half o^ 



/ 



133 



lid 



tt^curVttAnconpMftctt SOZ of the scores snd "our** £ score 
surpMmeiUpie. Throttgh slnpte subtrsctlon, SOZ *- 40Z <- 10% » 
it becomeieleer that .the dlitsace hetveen the £ and the 
Man uflOX. On the table* the £«ill be the one aoat 
cloaelt aasodated vlth the area value .1000. A scan of 
tbit table valuea givea .0987 as the closest figure. Its 
assoclsted £ Is .25, nhich must be given a negative sign 
due to P^^ fdl|j|bag on the negative side. ' 

1 Our answer: £ - ••25 for P4o- 
« 

We are elnoet/but not quite, finished. The problem 
aakaeCor Sally's score on the test, which ^aoves us to the 
s|^cond step: converting a £ to a raw score. Now the highly 
familiar converslcm formtila should do the trick: 

X - eo + X . 

substltotlng the glvens: 

X - -.25(100) +500 « 

X - -25 + 500 

X - 475 . . 

Bonus question: If 1000 people took the aptitude test, 
how pany/dld Sally beat? 

The solution Is much simpler than It might flist appear. 
Sally scored at Paq. By definition, she scored higher than 
40% of the people In her group. We get :^ 40% of 1000 - .40 
X 1000 - 400 people, 

1 

Let's retura to Tim from the previous example. • You may 
recall that he'^ored at^ye. Suppose that when he took the 
^ test, the group consisted of 11,978 students. How many did 

. ■ he beat? We need to take 75% of 11,978: .75 x 11,978 - 

/J 8.983.50 people . Looking at the answer, you may be worried 

about the half a person represented by the decimal figure, 
f and wonder whether he/she is very short. No need to worry; 

a fractional person does not exist, but Is almost certainly 
a "creation" of round-off 'errors that occurred when IClm s 
fP.R. was computed originally. For accuracy In our present 
» computation, we should keep the ".5" In our answer. 



ERIC 



134 



143 



..4.9 PQflTTEST (ana««cB In back) ^ 

1. Convert the following je acores Into percentile ranluo: 

e. -2.40 

b'. -1.20 

c. 0.00 

d. 40.90 V J 

e. +2.30 ^ 

2. Convert the folloirlng percentile ranka Into £ scorea: 

a. 15 ' . 

b. 25 

c. 40 i 

d. 60 / / 

e. 85 ^ 

3. In a dlatrlbutlon where X - 10 and o - 3, what is the 



percentile rank of xm^ acorea of: 

a. 14 

b. 5 - ' ♦ 

c. 9 



i 



4e In the seme distribution ss above t what row acores cor- 
respond with: ^ 

^0 ' 

""so ' r. 

C. P43 ^ 



Recowttendatlona for Unit IV : 

1. Try to understand the logic to the area probleQis. If 
ypu can achieve this understanding, ydu ndll be able to 
aoive any of the prototype problems » without having tb 
oleflitorise the specific solution steps for each. For ex- 
aiipfe/.if you can solve the problems presented in the 
preceding section, unit 4a9, you should be able to solve- 
the probl^w presented in units 4.7 and 4«8» as they 
icdeal vith the identical concepta. 

2a Always make a normal curve sketch, even a very rou^ one, 
as It serves Ms a rodnder of what the problem inyolvea, 
the informatlbn provided, etc. It also gives reality to 
the problem, representing the data as related to a dis- 
tribution of icores,not Just to a aeriea of fairlf 
imechanical dc»9utations. 



135 ^ 



3. Reaenber ohat all of the problemB t]iiat have been pre- 
sented are solvable through a restricted set of 
operations* Here Is a sumnaryi- 



Given 



Find 



Rair Score QO "Standard Score (z) 



Area (Z or P.R.) 



Use 
Formula 
Foroula 
Table 
Table 



X 



X 



Formula to find 
z^; then Table 

Table to find z} 
then Formula 



4. Tty not to regard the different problems as Independent 
entitles. A poor procedure for tinders tandlng the normal 
curve concept Is to memorize operations such as "for 
finding the area beti^een a -Hz and a I should add," 
If you can diagram the problem » the normal curve repre- 
sentation will remind you what to do. 



ERIC 



136 



151 



Unit IV levley Tett 
(anawera in back) 

We give a languagejBkills teat and end up with a normal 
diatrlbution with X - 75 and a - 8. 

a. What percentage of acorea are above .34«? 

b. What percentage are below 2.29z. (or what is 
ita P.R.)? 

c. What percentage are between +.24z and -.24z? 

d. If 1000 ^tudenta took the test, how many scored 

between -Ijs^ and -.5z? 

e. What percentage are^between raw scores of ,83 

and 87? 

f. John's acore waa 70. What Is his P.R.? 



g. Sandra's score was 90. What percentage scored 

above her? . 

• A 

h. Bob's P.R. was 25. What was his raw score? 

1. BONUS : What percentage scored between T - 35 

and T - 55? i 

Four atudenta received the following scores on a test 
where X - 500 and flr - 100. Aaauwe a normal distribution, 
and find the percentile rank for each student. 

- a. Mary ■ 575 

b. Ann -440 

c. Sue - 620 

d. Joan - 780 

e. Barry - 500 

f. Ed - 510 ^ 



137 



152 



UNIT V 

' CORRELATION ^ 

k. General Objectives 

Up to now, we have concerned ourselves exclusively with the measure- 
ment and description of 4AMgtz variables, such as scores on a test, 
heights, weights, etc. Frequently, however, researchers in most sciences 
are interested in testing hypotheses concerning the relation between two 
variables for a given population of persons or things; for example, height 
and weight, IQ and Grade Point Average, smoking and heart disease, exer- 
cise and physical health, etc. The statistical tool through which such 
relations can be objectively determined is called COHAZlOitioyi . In this 
unit we will introduce the concept of correlation mtd ^how how it may 
be used by researchers to provide insight into relations among different 
4)roperties or characteristics that define a given population. 

B. Specific Objectives 

5.1 Define correlation 

5.2 Describe the data collection procedures involved in 
conducting correlational studies 

5.3 Construct and interpret scatterplots of correlational 
data 

5.4 Define and interpret correlation coefficients >^ 

5.5 Compute correlation^ by means of different formulas . 

^.1 Instructional Unit ; Definition of Correlation 

In earlier units, we have dealt with variables (such as IQ, test 
scores, heights, weights) on an individual basis. Specifically, we 
have shown that through the assessment of such characteristics we 
can describe a given population in terms of several kinds of indices. 
For example, we might ref^rt that a random sample of high school Seniors 
in Vienna, Illinois has a me^n IQ of 102, with a standard deviation of 
14.5, and a range of 67 - 174. As will be described in later units, 
we might then use this information to make inferences about the popula- 
tion that this sample represents. Our results, however, would be 
restricted to I 9 scores as would th& conclusions we could then make. 
Later, we may wish to describe and compare the two populations on the 
basis of numerous other individual variables. 

Such assessments would tell us something more about the particular 
population(s) in question. But they would not tell us whether the 
various traits meaningfully relate to one another and can be used as a 
basis for making predictions aboiU; the characteristics of individuals 
in the population. We may know, Tor example, that students in a par- 
ticular school average 101.7 in IQ, and 508 on the verbal portion of' 
the SATs. Having this information, however, is not sufficient to answer 
questions such as; "Are IQ and SAT scores significantly related for 
this group?" "Will a student who is high on IQ tend to do well on his 
SATs?" "How is Johnny, who has an IQ of 78, likely to perform on the ^ 
SATs?" Questions like these can be asked in reference to numerous 
other variables: Is height related to weight? Is teacher salary 

> 138 



related to teaching effectiveness? Is noise level related to work 
efficiency? 

^ Such questions lead to the concept of COHAttatLOn . In Intro- 
>/"^uclng this concept, we might best begin with a definition. 

Definition: Ck>rrelation is a technique used to determine the 
relation between two variables. 

Simple enough, but one might appropriately question what is meant 
by the term, va/Uablz , When we speak of variables, we are referring to 
characteristics or events that are likely to differ from measurement to 
measurement. I.e., from element ^to element in the population being 
assessed. Thus, we can differentiate between variables and constants, 
the latter being a measure that remains the same across a series of 
assessments. We have been dealing with variables since Unit I when we 
constructed numerous frequency distributions of test scores. But just 
to make sure of your understanding of these concepts, try to identify 
the following measures as either variables or constants (cover the 
answers on the right) . 

a) IQ in a 2nd grade class : clearly a variable since the IQ 

scores of children will not all 

be the same. 



b) numb.er of Inches in a 
population of yardsticks: 



c) shoe size in a group of 



d) 



undergraduates (assume 
that all we^r shoes): 

number of teeth in a 
population of parakeets: 



e) number of digits in a 
group of social security 
numbers: 

f) number of face freckles 

in a population of adults: 



a constant, by definition. (Of 
course there may be trivial dif- 
ferences in length, but for a 
yardstick to call Itself a yard- 
stick, it would have to be 36 
inches long.) 

"^ Variable since shoe sizes would 
differ among individuals. 



a constant ; parakeets do tfot have 
teeth and thus all scores would be 
zero. 

a constant; all scores would be 9. 



a variable ; obviously people differ 
in the number of freckles they have, 



Now that we have illustrated what is meant by variable, let us 
return to our definition of correlation: the degree of relation between 
two variables. What does it mean if we say that two variables are re- 
lated in a given popula|jlon? 

One Interpretation is that values on one variable can be U8e\ to 
predict values on the other. Or, by knowing a person's standing on one 
variable, say height, we can make a reasonable guess about his standing 



ERIC 



139 



154 



on another, say weight. Or, people at certain levels of one v&rlable 
tend to be found at certain levels of another* IT* this Is the case, 
we can say that the two variables are correlated. Let's look at some 
exatnples • 

Example 1; Height and Wlfcht . Would height and weight tend to be. 
correlated In a population of adults? The answer le "y^s," Knowing 
that a person is' 7* tall allows us to be reasonably certain that he is 
heavier than average (unless his nickname is "stringbean") . Thus, 
heiglit and weight are likely to be related/ that is, correlatecT. As 
height increases in a population so does weight, at least on the average. 

Example 2; Toes and Hair Color . Would there be a correlation be- 
tween tn]Sberoftoe8~analia^ a population of physically normal 
adults? This one is designed to trap you, since number of toes would 
not be a variable and thus, could not be correlated. All "toe" scores 
would, by definition, be 10. Thus, being told a person's toe score 
would be absolutely useless in making a prediction about his hair color. 

Example 3; JQ and GPA . Would there be a correlation between IQ 
and Qrade Point Average in a population of high school students? Most 
likely there would: Knowing a student's IQ score would provide some 
basis for predicting his high school average. On the average , students 
with high IQs have high grades; students with low IQs have low grades. 

Example 4: Length of Name and Salary . Finally, would there be a 
correlation between the number of letters in people's names and the 
amount of money they make? We could correlate the two, but would pro- 
bably obtain little support. That is, knowing that Alexander Kowalski- 
freehoppin has 27 letters in his name provides little insight into his 
probable Income. Conversely, knowing a person's annual income would 
provide little basis for making even an educated guess about the length 
of his name. ^ 

I . 
Before going ohv to a more detailed analysis of what correlation 
implies, we'll leave you with the following thought: You are likely to 
find a correlation when scores on one variable seem to be associated in 
some systematic way with scores on the other variable; e.g., height and 
basketball ability; IQ and annual salary; daily temperature and the 
amount of clothing worn, etc. Do not read further until you think you 
understand why there would be a significant relation between these 
variables . ^ 

One more thought: Variables can be correlated in two different ways 
positively or negatively. These types of correlations will be discussed 
in considerable detail later on, but fot now, you should at least try to 
understand the folloi||^g very general definitions: 

1. We obtain a positive correlation when high scores on one variable 
tend to be associated with high scores on the other variable, and 
similarly, when low scares on one variable tend to be associated 
with low scores on the other. 



140 



Can you ate how this would apply In the case of: 
- IQ and grades? 

height and vel^t? 
^ annual salary and size of car? 

love of statistics and the closeness of this 

hook to your nlghtst^nd? 

2. We obtain a negative (or Inverse) correlation when high scores 
on one variable are associated with low scores on another 
variable » and vice versa. 

Can you see how this would apply In £he case of: 

absences and grades? 

annual salary and felony convictions? 
^ temperature and nunber of articles of clothing worn? 

To focus on the last example of a negative correlation, Isn't It true 
that when temperatures are high (90, 95, 100 degrees) the number of 
articles of clothing teii^s to be low and vice versa? Can you Imagine 
a person wearing only 1 article of clothing (a low value) when the 
temperature outside Is 5 degrees (a low value also)? (Okay, stop 
Imagining; It's time again to get back to work.) 

POSTTEST (answers In back) 

1. I>eflne correlation: , 



2. Which of the following measures could be examined In a correlational 
study (which* are variables)? 

a. number of digits In U.S. area codes 

b. nuid>er of freckles In a group of 2nd graders 

c. number of Illnesses In 1975 for adults In same 

d. number of toes In a group of physically normal 

e. number of friends possessed by Individuals 



3. Suppose we determined the Grade Point Averages of all seniors In a 

particular high school. How would Grade Point Averages probably 

relate to the following variables? Choose one of the following 
In giving your answer: 

a) no correlation Is likely 

b) a positive correlation Is likely 

c) a negative correlation Is likely 

1. height * 

2. number of abset&ces from class 



group 

3«-toed sloths 



141 



15o 



3. pulse rate 

4, IQ 



^ 5. amount of TV watching 

• 6. nuiid)er of books read during past year 

4« Of the following pairs of variables, which pair would be loost 
likely to correlate positively ? 

Pair 1: Age of automobile and miles per gallon. 

Pair 2: Distance north of the equator and average dally 
teiqperature for different cities. 

Pair 3: Nuitiber of bars and niindber of schools In different 
U.S* cities. 

2 Inst rue tional^nlt ; Conducting a Correlational Study: 

Data Collection 

Now that you have some understanding of what correlation Implies, 
we will turn to some of the basic procedures Involved In conducting a 
correlational study. The procedures are really quite simple, and, 
believe It or not, once you've read through this material, and learned 
It, you should be able to set up a correlational study of your own (wowl). 
Before starting, you will be relieved to find out that It Is not really 
necessary to memorize the procedures that will ba enumerated. What Is 
Important Is that you acquire a general understanding of the logic and 
rationale Involved. Here goes... 

Step 91 : Defslj^ the Problem 

Before conducting a correlational study, you should Identify the 
problem pr hypothesis with which you are concerned. Specifically, 
you should clearly determine which variables you wish to correlate 
and why . 

In addition to defining the variables of Interest, you should also 
specify the type of population to which your hypothesis Is directed. 
Obviously, the relation between variables such as nuiifcer of showers 
(or baths) taken on a weekly basis and popularity might be very differ- 
ent for a population of human beings and a population of baboons. For 
all we kno|7, the most popular baboons might be the ones who taKi|^he 
least number of showers (thus, a negative correlation). We doubt^V.^^^^ 
.^however, that such would be the case for humans. 

Enough about baboons and showers. For Illustrative purposes, letVd 
consider a researcher who Is Interested In determining whether a jBigr 
nlf leant relation exists between IQ and grades for a population of 
fifth gradets in Tupelo, Mississippi. He might define the purpose of 
his study in the following manner: 

y 



> 



142 

157 



Purpose ; to determine the relation between IQ and 
grades for fifth grade students In Tupelo. 

To make things Interesting* this researcher (let's call him A) 
actually has a hypothesds: IQ Is positively related to grades. 

Researcher B who specializes In physical education believes that 
his college students who are overweight will experience difficulty In 
performing gyimastlcs (specifically, tunfcllng) . He might define the 
purpose, of his analysis as follows: 

Purpose ; to determine the relation between weight and 
proficiency at tumbling for males taking 
college physical education classes. 



Step #2 ; Specify the Sample to.be Measured 



This really Involves determining what may be thought of as the "unit 
of analysis." That is, who or what will be used to provide measures on 
the two variables to be correlated. Will you be measuring people, 
chairs, baboons, farms, etc.? 

For Researcher A, it is obvious that the "units" will be people, 
specifically, 5th graders in Tupelo. Is It necessary for this researcher 
to measure IQ and grades for every fifth-grade student in Tupelo? It is 
not necessary since it would probably take too long. It would be far 
more practical to use a sample, say, of 50 students. The larger the 
sample, the more likely the data will reflect actual, real-life conditions. 

So, using a random selection procedure, Researcher A randomly selects 
50 students from the population of fifth graders in Tupelo. He is now 
ready to begin testing. 

Turning now to Researcher B - what Is his unit of analysis? Given 
his Interests, the units would be college students. For each student 
he can obtain a measure of weight and a measure (probably a rating) of 
proficiency in tumbling. He can then try to determine whether the two 
measures (variables) relate to one another, that is, whether they cor- 
relate . 

Since it would be too expensive to include every college student 
he decides to use a random sample of 25 students from which he hopes 
he can generalize to the entire population. 

St ep #3 : Measure Every MetBber of the Sample on Both Variables 

Correlation Iniplles determining whether scores on one variable tend 
to be associated In some systematic manner with scores on another variable. 
Thus, we need to obtain two sets of scores from each element In the 
sample being measured. 



"3 ^ 153 



For Besearcher A, this will Involve measuring all 50 students on 
both IQ and grades. 

For Researcher B, this will Involve measuring ai||^l 25 students on 
both weight and tuoi>llng skill. ^For the latter, he has each student 
perform a sequence of exercises, and then rates each, with the help 
of an observer 9 on a 1-*10 scale. 

Step #4 : Tabulate the Resultant Data 

This really Involves listing and organizing the obtained scores In 
order to facilitate their Interpretation. The goal, you* 11 remember. 
Is to determine whether the two variables assessed are related. The 
best way of obtaining a **qulck** picture of this possible relation Is 
to list the scores obtained from each Individual In the form of a table. 

Researcher A measured IQ and Grade Point Average for 50 students. 
For the sake of brevity and convenience, let* a assume that he only used 
five students In his sample (summarizing the data for 50 will take too 
long). Having obtained his data, he might tabulate the scores In the 
following manner: 

Sample: Grade Point Average ' 



Freddie 120 3.50 

Gertrude 90 2.00 

Horatio 100 2.50 

Leona 70 1.00 

Percy 110 2.00 



Note that even a casual glance at 'the resultant scores will permit some 
Interpretation. Specifically, IQ scores and grades are positively cor- 
related . Individuals with high IQs tend to be those who earn high 
grades (Freddie, for example), while Individuals with low IQs tend to 
be those who earn low grades (Leona, for example) . In fact, with only 
one exception (Percy) ^ as IQ scores increase, grades tend to Increase. 
For all we know, Percy, who has a relatively high IQ (HO), may be a 
person who does not study or spends too much time with Extracurricular 
activities (very possibly Leona). The important point is that, for the 
group as a whole, IQ and grades appear to be related. Put another way, 
knowing a person's IQ allows us to make a better prediction about his 
grades than would be possible if such information were not available; 
similarly, knowing about a person's grades allows us to make a better 
prediction about his IQ than would otherwise be possible. Be alert 
though! DO NOT use knowledge of correlations to prejudge individuals. 
Use correlations to derive understanding about how variables operate 
in relation to one another. Do not use them as a basis for categoriz- 
ing and stereotyping people before they have been given a fair chalice 
to perform. 

Let's now tum'^to Researcher B, who was interested in determining 



144 



153 



the relation ^etveen weight and tumbling aklll of male college students. 
For convenience, ve*ll pretend that his aanqple consisted of only six 
students althou^ we should recognlxe that for an actual *8tudy, this 
nunsber would be much too small (iiK>l9t authors of statistics texts recom- 
nend a sample sl^e of at least 25-30 for correlational analysis) • Like 
Researcher A, he would first tabulate his results. To Illustrate this, 
we'll make up some ''hypothetical** scores for the 6 contestants; 



Students 


Welgiht 


Woody 


112 


Wyeth 


186 


fibllo 


227 


Abe 


136 


Grove r 


250 


Bert 


150 



Tumbling Score 



Looking at the data, what Is your Impression regarding the relationship 
between variables? 

Your conclusion was right If It was that a correlation does seem to 
be present. Give yourself further credit If you concluded that the 
direction of this relationship Is negative. Higher weights tend to be 
associated with lower scores. Conversely, the lighter guys (e.g.. 
Woody) tended to receive the higher scores. Assuming that anyone cared 
about this, and that the relationship was Tellable (could be replicated), 
weight could be used as a meaningful predictor of tuobllng skill and vice 
versa . 

Returning to the example, would weight be a perfectly accurate pre- 
dictor of tumbling skill? The data suggest that, It would not. Rollo, 
for example, at a klng-slsed 227 lbs., received an absolutely perfect 
tuid>llng score of 10 points I Apparently, there are other factors besides 
weight that Influence tumbling. (If you want to find out what happened 
In Rollers case, check the Inset.) 

***<r** **** ** 

Some Assault! Rollo *s Great Roll 

The absence of a perfect correlation suggests that one 
variable does not completely account for the other. 
Clearly, there are numerous factors besides weight 
that can Influence one's skill at tumbling... 



In Rollo' 8 case. It happened to be the variable of 
"roundness." What transpired (be glad you weren't there) 
Is that Rollo, being only 5' tall but 227 lbs., tucked 
hlwelf Into a ball which allowed him to gain momentum 
after each tumble. He took off across the gym like a 
gigantic bowling ball, knocking over students and appa- 
ratus as he went. His form was superb, and the results— 
36 tuinbles and 23 students down — were ra their striking to 



ERIC 



My the least. Had a score of "11" 1>een possible* 
iRollo would have received it. Besides, if you ve^re 
the instructor, votild you dare to doimgrade this 
tuabling terror? 

(Jooclusion: In this isolated case, weight worked 
tb the individual's advantage. Correlation can be 
helpful in naking general predictions, but could 
lead to a wrong conclusion in any single case. 
Weight was shown as accounting for some, but not 
all of the differences in tutnbling skill. 

\*******^*** * 

In suiamary, after you collect your data, it is usually quite 
helpful und always appropriate to present it in tabular form. The 
table will provide a first impression of what the results show. 



co rRELATION not 

• causation 

(necessarily) ^ 

Finding a correlation merely implies the existence of a 
relationship between the two variably; aX dcu not imply 
tkat onz n^cu^a/UZy <um2^ tht oXhoA . with re- 

gard to Researcher A*8 study, can you conclude that IQ 
causes grades or that grades cause IQ? Such a^^nclusion 
would be unjustified without much more sophisticated 
analyses. What usually causes grades, it would seem, is 
the number of items people get right on tests. The same 
variable (nutober rig^t) applied specifically to IQ tests 
is the direct "cause" of IQ scores. Doing well on an IQ 
test is something that should typically go ^long with 
doing well on academic tests used for grades. Thus, 
"third factors" such as test- taking skill, problem-solving 
ability, reading proficiency, etc., can be the actual things 
that tend to produce the association between measured IQ 
and obtained grades in a course. By the same reasoning, 
it would be incorrect to conclude that weight, as used in 
the second exaiq>le, is necessarily the cause of tumbling 
perfdrmanlce. Weight and tumbling could well be associated 
with third factors, such as balance, speed, coordination, 
etc« whose influences are the primary determinants of the 
correlation.' In fact, the relationship suggested in the 
eicample could be mostly attributable to the instructors* 
being biased in their perceptions of the tumblers* perfor- 
mance. That is, without being aware of what their biases 
are, they ml^t tend to downgrade the heavier people and 
upgrade the lighter people, even though the performances 



V ERIC 



U6 



161 



1 



of the tvo groups ml^t be of equal quality. Here, 
the "cauae** of tuobllng aklll vould not be weight » 
per ae , but Inatructora' blaaea > 



Step #5 : Construct a Scatterplot of the Data 

Since constructing and Interpreting scatterplota wllli be the sub- 
ject of the next ^structlonal unit, the present discussion will be . 
brief. First, loaglne that Researcher A actually used 1000 rather 
than 50 students In his IQ-*gtades study. He would still want to 
tabulate the results. After doing so, he would end up with 1000 pairs 
of numbers. Getting a "first Impression" would be rather difficult, 
especially If he wants to maintain his- sanity and his e/eslght. In 
"real life" he would probably look toward the computet to provide the 
actual correlation, without becoming too concerned with a first Im- 
pression. But let's suppose there Is no computer and he ^^es not 
know how to calculate a correlation (which you will learn tk do In 
Unit 5.3). In this Instance, he would probably find It very^idpf^l 
and Informative to construct what Is called a "scatterplot." 

A scatterplot Is simply a graphical representation of the scores, 
with each point on the graph representing the scores obtained for an 
Individual unit (person) on both variables. The graph consists of an 
X axis (horleontal) , showing the range of possible scores on one varl-- 
able, and a Y axis (vertical), showing the range of possible scores on 
the other variable. When used together, these axes Indicate appro- 
priate positions for representing the scores obtained by each unit 
(person) In the sample. We represent these scores by placing a point 
for each person at the point of Intersection of straight lines dramv^ 
througih his score on Variable X perpendicular to the X axis and through 
his score on Variable Y perpendicular to the Y axis. Sound confusing? 
It's really quite simple^. Look at the following scatterplot for the 
five people measured In the IQ-gitedes study (Study A)« 



147 



162 



4.00-1 



aso- 



3.00- 



3.50- 



W2.00- 

21.50H 
S 



1.00- 



(Freddie) 



(Horatio) 



(Gertrude) 



(Percy) 



(Leone) 



If 



—r~ 
80 



— I 

00 

IQ(X) 



— I — 
100 



— I — 
110 



-20 



Note that, from the scatterplot, we cj^n easily determine how each 
Individual In the sample scored on the two variables. The dot used to 
represent Freddie's scores Is perpendicular to the Y axis (Grades) at 
the score point 3.50 and perpendicular to the X axis (IQ) at the score 
point 120. Since the sample Included five unlts^ the scatterplot will 
consist of five dots. 




.(/■ 



^ Suppose thel resear^er Included every student In the class In 
his study. The jresultant fcatterplot od^t look something like this: 



ADO n 


• 




• . • •• 






is 

S2.00- 


• •• • 
• • • •• 


o 




iJOO- 


s 



—I — 
70 



90 

IQ 



1^0 



130 



Once a^lii, every dot on the graph represents two scores for the lndl«- 
vldual units that were measured. How many people were Inclijided In the 
sample? ( If you feel Inclined to count the dots you will find that 35 
students were Included. 

• " ,^ 

By looking at the scatterplot, anyone who Is familiar with corre- 
lations would be able to characterize the relationship easily. In this 
case, grades and IQ appear to show an Imperfect, positive relation for 
thlfif particular sample. By the time you finish reading the next Instruct 
tlohal unit, you will be able to do this too (Isn't statistics exciting?) « 

By the way ... ^ 

v. . a scatterplot^of the "welght-tunibllng" data (Study B) would look 
like: 



Y 

lOn V 
8- 



Which "dot" Is Hollo? 



2^ 



—I 1 I 1 I I A 

100 130 leO lOd 220 2S0n 

wetdfrr 



ERIC 



149 



^64 ' 



2 POSTTESf (Answers In bkck) 

V . ■ . ^" 1 ■ V \ V. ^- ^, 

1. A teacher is interested In whether students' Smoking habits relate 
to their attitudes toward school. Briefly describe a set of pro- 
cedures through which she could investigate this question, (there 
is no one correct answer - numerous approaches aii'e possible, depend 
Ing upon how one interprets her interests •) What would.be appro- 
priate units of analysis? What variables would need to be assessed 
and how would the assesssient be structured? 



2. 



A basketb^l coach is interested in determining whether height 
relates /£d point production for starting players on high school 
basketball teams. He measures a saiiQ)le of five players on both 
vari^Vles and obtains the following results: 



Name: 



Height (in inches) Points (average per game) 



Rlmm 


83 


28 


Hook 


80 


20 


Duncker 


70 


5 


Block 


76 


14 


Walker 


65 


7 



Given these data, would you conclude that there is a relation 
present? If so, what type? 

'■ * * , 

3, An animal researcher believes that the amount of aggression dis- 
played by dogs (biting people, barking, etc.) is significantly 
related to the amouht of punishment they receive. In testing 
this hypothesis, what would constitute the unit of analysis (that 
is, of what would the sample to be tested be comprised)?^ 

4. V (BONUS) A medical researcher believes that the frequency, of daily 
fainting spells is positively related to the average daily tem- 
perature. What would be the appropriate units of analysis in 

conducting research on this question? 

Suppose the researcher's interest shifted to the question of 
whether people's tendency to faint on a given day of the year m 
(Oct. 29) is positively related to their body temperature. What 
would the units of analysis be here? 

J Instructional Unit ; Construction and Interpretation of Scatterplots 
^ As suggested at the conclusion of the preceding unit, scatterplots 



150 



f 

provide a helpful and often highly Interpretable visual representation 
of correlational data. By examining the pattern fornied by the various 
points, bne can obtain an Immediate Impression of the strength and 
direction of the relat^lon between the two variables of^^ Interest • 

Construction / ^ 

Note: Constructing a scatterplot Is very simple » and It 1^ quite pos* 
slble that you picked up the rdles from the previous section. ..or from 
another course. Take a quick look at the explanation that follows. If 
you feel thajt ydu already know It, save time by either skimming through ' 
or skipping (it completely and branching to the section on Interpjretatlon 



Explanation 

1. Draw a horizontal number line beginning with zero to bemused adr 
the X axis. Similarly » draw a vertical nunber line up from th^ 
origin of the X axis. Call the ve/Elcal axis the Y axis. Note\ 
that you are simply constructing the first quadrant of a typical"^ 
coordinate system In two dimensions. 

2. On each axls» place appropriate numeric labels that will permit 
you to fully account for the full range of variation for the 
particular variables being represented. For example » If your Y 
variable was height (the possible scores for which could range 
from 60 In. to 80 ln.)» you might divide your Y axl/s Into equally 

* spaced Intervals of five units » starting at 60 and ending at 80. 
If your X variable was weight » with the range of scores being 
from 100 to 200 » you might divide your X axis Into equally spaced 
Intervals of 20 un^j3» starting at 100 and ending at 200. The 
Intervals you finally select should depend upon how much Infor- 
mation you want to provide In representing the data. For this 
particular Illustration, an exanqple might be: 

y ^ 
»0v 



75- 
70- 



Note that the slash marks are 
Included to show that parts of |^ 
the X & Y axes are not present % 
In the graph. Without this S 
convention, it i^ould be easy 
to distort relationships. 

\ 60i 



— I I I I I 
00 120 140 160 180 200 
WEIGHT 



3. Once your axes are construe ted » plot the individual pairs of scores 
by representing them in terns of "do^" or "points" appropriately 
positioned with respect to the X and^P^'SisM. Each point reprem^nts 
a person (or thing) and how^he or she perf^rrii®^ ^ Y vari- 

ables. Hdw to plot these points is easier Jto demonstrate than to 
describe » hut' on6 way would be to: a) start with the first person 
and look at his/her score on variable X; t) move along the X axis 
until you find the appropriate position for tHat score; c) keeping 
your pencil at that position » find the same persotPs score on vari- 
able Y; d) then move your pencil vertically lizards until -it reaches 
the Y axis position that corresponds with the just determined Y 
score; e) place your "dot" where your pencil rests and; f) repeat 
these procedures for the remaining elements in your sample. 

Fdir example^ let's suppose that the first persjiff in the height/weight 
study received scores of 180 lbs. and 75 inches. The dgt represent- 
ing these scores would be positioned as follows: (Notp that it 
correspondis to 180 units on the X axis, and 75 units on the Y a^xis.) 



80-1 



75- 
t TO- 

o 

65- 
60-1 



0 



■ — r— — I — —t 1 ^ 

100 120 140 160 180 200 
WEIGHT 



With ten people in the sample, the scatterplot of the height/weight 
study might look something like the one shown on the next page. For practice 
can you dete;rmine the height and weight scores of the person represented 
by dot "A"? ' 



152 



I — -I 1 I 1 X 

100 120 140 160 180 200 
n y--* WEIGHT 

You should be able to -determine that person A weighs 130 lbs. and 
Is 75 Inches tall (lt*s probably easier to find him on the graph than 
it Is In real life). ' 

End of Explanation 



Interpretatlbn of Scatterplots 

By examining a scatterplot, It Is usuajLly possible to obtain a 
fairly accurate first Impression of the direction and strength of the 
obtained relation between variables. 

Direction : You will recall that the relation between two variables can 
be either positive or negative, A positive relation occurs when high 
scores on Variable X tend to b< associated with Klgh scores on Variable 
Y, and low Scores on X tend to be associated with low scores on Y. For 
negative relations^ the converse Is true: high scores on a tend to cor-- 
respond with low scores on Y; low scores on X tend to correspond with 
high scores on Y. This classification can be summarized as follows: 



Positive Correlation (+) 
X scores Y scores 



Negative Correlation (-) 
X scores Y scores 



high 
low 



high 
low 



high 
low 



low 
high 



1^ 



Rule: 



If two variables are^ positively correlated^ the dots represented 
on a scatterplot will ^nd to slope upwards (from the bottom 
lefthand comer to the top rlghthand ^cornet) • 

if ^two variables are negatively correlated » the dots represented 
on a scatterplot will tend to slop» downwards (from the top 
lefthand comer to the bottom righthand comer) • 



153 



16. 



Turn back one page and look at the scatterplot for the height/ 
wel^t study. What type of relation does this appear to 
represent? 



Ansver : A positive relation because there Is a clear tendency for 
the points to slope upwards. 

Without becoming too concerned about actual Interpretation, deter 
nine vhat types of relations appear to exist for the following thjree 
pairs of variables, as suggested by the scatterplots: 



lOOn 



4 50- 



• • • 

« • • 



5- 



C3- 



250 
B 



— iX 
400 



20-1 



• • • 



f 65 
D 



90 



1000 
F 



— |X 
3000 




Judging from the scatterplpts, variables A and B seem to be negatively 
related; C and D seem n/sterlous; and E and £ appear^ to be positively 
related. But, wait, what about this C and D correlation— the points do 
not appear to have any slope whatsoever! The only conclusion that one 
can reach Is that C and D are not correlated — there Is little or no 
association between the values obtained on each. * 

Do you understand why particular slopes follow from negative and 
positive correlations? When there Is a direct association between scores. 
I.e., a positive correlation, X scores near the origin (0, 0 point) will 
be avgeclated with Y scores near the o|;lgln; X scores fat from the origin 
wlli\l£Vewlse be associated with Y scores far from the origin. The re- 
suljc is an upward slope. For a negative correlation, the reverse occurs; 
the result Is a downward slope.. If you're having trouble picturing this, 
you may want to read the additional explanation provided on the fi^B.ow- 
ing page. 



f 



15^ IMJ 




Additional Explanation: Ups and Downs on a Scatterplot 

Why, for example, does a positive correlation produce an upward 
slope In the scatterplot points? The answer lies In the definition 
of positive correlation: High on X, High on Yt Low on X, Low on Y. , 
Thus » many of the dots to be placed on the scatterplot will represent 
people whose X and Y scores are related In one of these two ways* 
Putting this in .another way, as the dots are moved farther to the rlgh( 
on the X axis (Indicating higher X scores), they are also moved farther 
upwards on the Y axis (Indicating hlgjlier Y scores), and vice versa. 
The result Is the general pattern of movement from lower left to upper 
right on the scatterplot. 

Let's see how this happens. Suppose every tree In a forest Is 
tagged according to the year It was plani^ed (that Is, Its age). We Hay 
then wish to correlate the relation between age (call this X) and the 
nund>er of annual rings (call this Y) . Intuitively speaking, this" should 
produce a positive correlation dlnce trees with low ages would have few 
annual tln^, whereas those with high ages should have many annual rings. 
In fact, there should be a perfiect correspondence, as reflected below. 



y 

40-1 



30- 



§20-1 
10^ 



•a 



•A 



fD 



"T — 

10 



"H r- 

20 30 
AQE 



-1^ 
40 



A somewhat dlffiitttit j^attdrft Wotild emerge i> however,' if we plotted 
datm ^^^Sti^e^i^ttvely correlated. Take, for Instance, a correlational 
«^^*^-^--^^bdy designed to determine the relation between number of strokes and 

money earned for all golfers participating in a local tournament . t Hypo-^- 
thetical data from six participants could be as follows: 



ERLC 



Golfer 


Strokes 


Winnings 


Potter 


80 


300 


Green * 


85 


200 


Wood 


70 


AOO 


Parr 


65 


500 




90 . 


100 


Ruf£ 


100 , 


' 50 



155 



1 vu 



Here ve get the opposite (a doimtfard slope) since the data are 
negatively coi^elated (hlgh-lon; low^hlgh). The pattern Is shown 
belov. Note, for example, the one X score nearest the origin Is ); 
associated with a Y score far from the origin. See how the down- 
ward slope develops? 



100- 


- • 


95' 




90- 








i 




5 ro- 




es- 





r 1 1 1 1 X 

100 200 300 400 500 
WINNIHQS 

In siuomary, when the points on a scattetplot slope upwards, the 
trend Is Indicative of a positive correlation; downwards^ is indicative 
of a negative correlation; no slope is suggestive of zero correlations. 



Strength of Relation 

ft ' ' • . 

Examination of a scatterplot also provides an indication of the 
strength of correlations. Keep in mind that correlations do differ in 
terms of strength; for exaiEq>le, hip width and shoe size would both be 
correlated with weight . In most instances, however, we would be better 
off basing our judgments about weight on lilp widtfi than on shoe size. 

How do you determine the strength of a correlation? As will be dis- 
cussed in the following instructional unit, correlations can be assessed 
through very exact, precise means. Bu^, if all one desires is a general 
overall Impression, a quick glance a^ra scatteiflot will usually suffice. 

Rule ; In examining a scattferplot - i:he more the points approximate a 
straight line^ the stronget the relation between variables. Whether the 
slope' is positive or negative is not relevai^t. 

Given this simple rule, you are now in a position' to evaluate the 
strength of correlations simply by looking at the corresponding scatter- 
plots. Which of the three variables. A, B, or C, appears to correlate 
0K>8t strongly with IQ (turn page for graphs)? 



I 



156 



i7i 



10 



10 



f 

/ # • • 



10 



V 



To feel completely accontpllshed and highly learned, you would have 
to answer C. Why? The strength of a correlation is independent of its 
direction. Thus, it is certain that the strongest ;relatioh is that be- 
tween IQ and variable C, since /^the points more closely approximate a 
straiight line relative to the patterns shown for the other two variables. 
Of variables A and B, the latter seems tao re closely related to IQ for the 
same reason. The best predictor of IQ is probably Variable C, the 
second best is probably ^Variable Bg while the least effective is probably 
Variable A* , 

Ixx summary, looking at the slope of the points in a scatterplot pro- 
vides insight into the direction of a correlation; that is, whether it is 
positive or nej^tive. Determining the degree to which the points in a 
scatterplot fall into straight line pattern provides insight into the 
strength of the correlation; that is, how closely the two sets of scores 
are associated. 

3 POSTTEST (answers in back) 

1. A researcher believas that the amount of money people earn (annual 
salary) is significantly related to their educational background 
(nuid>er of years of schooling). To test this hypothesis, he decides 
to correlate annual salary with years of schooling for a sample of 
30«year-old males who reside in hfb home town. Although he probably 
should employ at least 25 subjects in his sample, for purposes of 
of convenience we will assume that he only tests six subjects. 



The data he obtains is as follows: 



Subjects 
fl 

#3 
H 
#5 
#6 



^earJ»of Education 

\^ 

16 
18 
. 18 
14 



Salary (In units of $1000) 

8 
17 

6 
16 
20 

9 



a. "MnstAict a scatterplot of these data (treat salary as the 
Y variable) . 



157 



17, 



b. Does there appear to be a relation? If so, what type? 



c. Suppose an additional subject, whose Education and Salary 

scores were 20 and 5 respectively, were to be Included In the 
sample. Would this tend to strengthen or weaken the obtained 
relationship? Why? 

• \ 1 . , ^ 

1 ^ 



Which of the following three scatterplots represents the strongest 
relation between variables? 



• • • 



PLOT A 



PLOT B 



PLOT C 



Briefly characterize the above scatterplots In tertns of the direc- 
tion and relative strength of the relations they depict. Do the 
sane for the two scatterplots presented below . 

1 



• • • 



PLOT D 



PLOT 



Note ; Do not become concerned If your descriptions differ slightly 
from those used In the answer section. Determining whether a 
correlation Is moderately strong, very strong, etc. Involves a 
lot of subjectivity. / 



ERIC 



"5 7.' 



« 



5.4 Instructional Unit s The Correlation Coefficient 

Assuming you have understood the preceding sections, you should now 
be capable of doing many exciting things pertaining to the concept of 
correlation: You know what correlation Implies, what types of relations 
are possible, and how to assess relations xoerely by examining the pat- 
terns represented on scatterplots. Given all these skills, perhaps you 
will be able to answer the following question: Of the two relationships 
shown below, which Is probably stronger? Note in one Instance we are 
correlating Grades with IQ and In the otfier we are correlating Grades 
with Number of Class Absences. 



s 



i 



IQ 



ABSENCES 



Perhaps you are Inclined to pick IQ as^ the better predictor of (the 
more closely related to) grades because the relationship appears to be 
positive. This rationale would be Incorrect since the strength of a 
relationship Is Independent of Its direction. Then, perhaps you are 
Inclined to pick absences as the more closely related sl^ce the scatter- 
plot "appears" to resenible a straight line pattern more closely than Is 
the case for IQ. Here, the rationale would be correct, but can you 
really be sure about your decision? 

The conclusion to be reached Is that scatterplots are useful from 
the standpoint of providing a general Impression of what the data probably 
Imply, but they cannbt be used for making precise, certain Judgments. 
Recognising this, statisticians have derived what, is called the correla- 
tion coefficient , which when computed, defines the direction and strength 
of a relation in clear, precise terms. More specifically, it is also 
often called the Pearson-product-moment correlation (since Pearson 
developed the formula). The correlational coefficient is syiubolized as 
"V when the data is obtained from sanqples, and "p" when obtained from 
populations. 

Definition s The correlation coefficient . A, is a statistical Index used 
to Indicate the precise direction and strength of the relation being 
examined. It is expressed in the form of a numerical value which can 
range from +1.00 to -1.00. 



ERIC 



159 



17^ 



) 



The sign of the coefficient or Indicates the direction 

of the relationship 9 positive or negative. 

The absolute value of the numerical component Indicates the 
strength of the relationship. By absolute value Is meant the value 
of the nuBi>er without the slgn» I.e., Its distance from zero. 

ExEomle Problem ; On the basis of the above definition, answer the 
following questions In regard to this series of ^'s: 

-.26; +.09; -.78; -.47; +.51; -1.26; +,01 

k) Which of the above coefficients could not possibly be obtained? 

b) Which Indicates the strongest poisltlve relation? 

c) The strongest negative relation? 

d) The strongest relation overall? 

e) Which t If represented on a scatterplot, would result In a 
pattern that most closely approximates a straight line? 

Answers : 



a) the 4 " -1.26 could not be obtained; It Is beyond the possible 
range of -1.00 to +1.00. Somebody goofed In deriving It. 

b) the strongest positive relationship Is +.51 — It has the highest 
absolute value. 

^c) the strongest negative relationship la A. « -.78^- It has the 

highest absolute value. * 

/ 

d) the strongest relationship overall Is -.78— It has the highest 
absolute value of any of the coefficients listed. 

e) the coefficient that would most closely^pproxlmate a straight 
f line pattern Is again. K » -«78 — It Indicates the strongest 

relationship (see Unit 5.3 for discussion of scatterplots). 

A coefficient of +1.00 ^Indicates a perfect positive correlation, 
which If represented on a scatterplot. would result In all points falling 
In a straight line pattern with an upward slope. A necessary (although 
not sufficient) condition for a perfect positive correlation Is that the 
person who scores bluest on Variable 1» scores highest on Variable 2; 
second highest on Variable 1. second highest on Variable 2; and so on. 
Examples would be the relatlpmhlp between annual rings and the age of ^ 
trees; aild the relation betweei^ ^elght In Inches and height In centl- 
meters for all students In a class. 



160 



A coefficient of -l.OO indicates a perfect negative correlation, as 
vould be represented by a downward straight line pattern on^a scatterplot. 
A necessary (although not sufficient) condition for this type of relation 
is that the person who Is hi^est on one variable is lowest on the other; 
the person who is second highest on one is second lowest on the other; and 
so on. An exatople would be the relation between multiple-choice test 
scores and the nunber of incorrect answers on the same test (think about 
it) . Whenever you obtain either a perfect positive or perfect negative 
correlation, you are able to predict one variable from the other without 
error; that is, predictions will be perfect. | 

A coefficient of ^ ■ 0.00 indicates no relation between variables, 
as would be represented by a scatterplot which has no slope whatsoever. 
This occurs when there is no systematic coilf^spondence between scores on 
the two variables; people who score high orf one variable obtain all dif- 
ferent score values on the other. Examples would be the relation between 
IQ and freckles; height and grades in Freshman English; length of name 
and visual acutty . A zero correlation indicates that knowing a person's 
score on one variable^ provides absolutely no basis for predicting his 
score on the other. 



(e.Jg., 



A coefficient greater than /t ■ 0 and less than ^ « +1.00 (c 
+.72, +.07, +.91) indicates an imperfect positive correlation, rfs would 
be represented by a scatterplot in which the points slope upward, but do 
not fall in a straight line. Examples would be the relation between, 
height and weight ; and grades ; etc. 

A coefficient less than ^ « 0 but greater than h, « -1.00 (e.g., -.68, 
-.02, -.95) indicates an imperfect ttggative correlation, as would be 
represented by a scatterplot in which the points slope downward, but do 
nspt fall in a straight line. Examples would be the relation between num- 
ber of absences and grades ; weight and ability to fit through small doors ; 
darkness of skin and susceptibility to sunburn (note that very light people 
tend to bum more easily); etc. 



Some Examples: 



• • • 



• • • 



f 8 0.00 



f s .80 



fs • .90 



161 



17u 



Final Note ; 

Correlation coefficients allow one to m^ke an accurate and preplse 
determination of the degree of relation between the two variables of 
Interest. The sign of the coefficient Indicates the direction , + or 
the iri>8olute value of the number Indicates the strength . Coefficients 
of ±1.00 represent a perfect correspondence between scores. By knowing 
the correlation coefficient, one can Infer how accurately one variable 
can be predicted from the other. ' ^ . 



5.4 



Example ; 

Suppose that the correlation between high school grade point average 
(GPAV and college Freshman CPA is +.47. The correlation between ACT 
scores and Freshman grades Is +.61. Which provides the better predictions 
of Freshman grades, high school grades or ACT scores? Apparently, It Is 
the latter. Will predictions be perfect? No, but they will be more 
accurate than they would be without the ACT Information. jj^ / 

POSTTEST (answers In back) *V 



1. Briefly describe what Is meant by the correlation coefficient. 
, values 'can It take?^ow are those values Interpreted? 



What 



1 



2. Match the following coefficients to the scatterplots shown "be^pw. 



a. 


-1.00 


b. 


-.50 


c. 


0.00 


d. 


+ .50 


e. 


+1.00 



\ 



4 S 

3. A researcher finds the correlation between amount of- dating and 
grades to be A " -.20. He 'also finds that the correlation between 
nunfcer of absences and grades Is A - -.50. 



162 



ERIC 



177 



J 



Botf vould yo\i characterize the two relations that were obtained 
t (positive, negative, perfect, li^ierfect, etc.)? * \ 



b. Whieh l8 a better predictor of grades, dating or absences?^ How' 
do I you know? 



4 / 



c. On the basis of the results, how would you charadterlze the gen- 
eral grade pactem (high or low) for students who date frequently? 



John Is a heavy dater, not because he weighs 190 Ibs,^ but because 
he generally has about 50 dates a week» more than one of which he 
soioetlaies absent-mindedly schedules for the same time. Based on ^he 
correlation. Is It necessarily true that John's grades are among the 
lowest In his class? ^ < 



d. Which of the two scatterplots would probably represent the daftlng- 
Rrades relatlofl (as opposed to the absences-grades ) ? wWhy? 




A researcher finds the correlation between years of education and 
annual Income to be 4 > .26 for a random sample of QUiles over 30 
years of age, ^living In different areas of the country^. In a sepa- 
rate study, he finds the correlation between distance of residence 
from a large city and annual Income td be ft - -.39 for the «^me 
sample . • , 

.... v.. , .. . 'i, . . 

a. What types of relations were obtf(lned? 



( 



b. Which Is likely to be the bettef predictor, of Income, years af ^ 
education or distance "from a large city? Why? 



163 



17, 



c/ On the basis of the obtained relations, would income tend 
to be high or law for people with many years of education? Would 
income tend to be high or' low for people who live far from a large 
city? Could you be certain that these tendencies would apply to 
all ^people? Why or why not? ' 






1 


d. ^^"tobh relat4>ons?were 


represented on scatterplots, which would 


mote 't:losely approsttmate a 


straight line pattern? 












— ^ 



5 Instructional Unit : Computation of 

^ 'We have now reached the last 'step in our freattilMt of correlation; 
/ you kn^' how to interpret corre^|^ion coefficienta-i^ut; the question of 
how to pompute them remains. IVo formulas will lie presented, one being 
more valuable from a conceptual th^n a computational standpoint, and the 
other the reverse* A third (special) formula will close the unit. 

The *Z score, formula . ^ „ 

This formula involves computing the correlational coefficient^ by 
working with Z scores obtained on each varj.able« Remember Z scores 
o (Unit IV)? £ach raw score can be converted to a z score by— th6 formula:. 

X - Y ^ 



You^hould recall that Z scores indicate how many standard deviations 
~abbve or below thennean a certain raw score lies. ' Thus, a person whos4 . 
.raw score on a spelling test translates into Z - +3^.00 has performed 
quite well; one ^ith a Z score of -1.98, for example, has performed 
relatively poorly,. * -/ ^ ^^ / 

The z score formula for conqpjyitlng correlations is faiifly easy Itp » 
understand conceptually. It is » cumbersome to use, however, unless the 
. raw scores on each vari^le are already *ln z score form. 

i. „ ' " < ■ • 
The formula is as follows^ 

^ li 

What it meai^s ±s that to compute the correlation cc^fficient, you 
must: (l>%ultiply each person * s* z score on variable X by his/her 
score on vari^le / ; (^^.obtaln the sum of these crossf^rod^ucts; % 



r 



164 



* 

and (3) divide the sum by n . The result will be Jt. Easy to use If the 
Z scorers are provided; cuiibersoine to use If only the raw scores are pro- 
vided, (n- nuiober of observations; I.e., people who were tested.) 

Exaigple 

A teacher Is Interested In.detertninlng the relat^n Ij^tween Math 
Achievement and Spelling Achievement for her class ot 5th graders. ^ She 
testa all her students (n « 7) 6n both variables. The tests are graded 
by computer, and the results she obtai.ns ^clude the z score for each 
raw^ score value .« By having these z scores, she knows that a correlation 
coefficient can easily be computed by u6e of the*z score to rmula. ' The 
results are as follows: ' 





z score .Lon X 


z score on V 




Child 


. (Math) 


(Spelling) 


^ A 


1.40 


. i.oo 


1.40 


B 


J.OO 


' 1.40 


1.40 


C 


.70 , 


.70 


'.49 


D 


.00 


tOO 


J .00 


f E 


- .60 


-1.20 


.72 


r F 


-1.30 


- .60. 


.78 


J 




-1.30 


1.56 








= 6.35 











n 



/ 



Note that the availability of z scores makes the computation of K 
amazingly simple, althoiigh the result, K « .91 (though accurate) mights 
seem a^lttle far-fetched. Do you think you'd^ obtain such a correlation 
between mdth and spelling achievement many times If you checked many 
^classes? / f " 

^ If you think about It, It should be easy to deterJilLne (in a general 
sense) why the z score formula works.* If positive z scores are asso- 
ciated with positive z scores, and negatives with .negatives, we should 
end up with a positive correlation, right? Why does t?hls happen with the 
formula? The answer should be imderstandable if you think about it: a 
positive z multiplied by' a oosltive z yields a positive z^z ; similarly, 
a negative z multiplied by S nfegatfve z yields a positive ^ z^t . The 
result is all positive numbers 1^/the rlghthand column, which, wnen 
added together will yield a positive Tz^'z and thus^. a positive correla-' 
tlan. Watch how this happens inVthe sample data shown below, in which 
perSermances on Variable X correspond perfectly with' performances on 
Variable Y (in terms of magnitude). 

^ ^ ) ^ ^ 



Example 



ERIC 



Child 


2x 
— 






■ A 


+1.50 


+1.50 


+2.25 


B 


. + .50 


+ .50 ' 


+ .25 % 


C 


.00 


i . .00 


.00 • . 


D 


- .50 


- .50 


+ .25 * ' 


\ 


-1.50 


-1.50 


+2.25 * 








- = 5.00 











^^X^j/ ^ 5/5 = ;H.OO 

n 

When performances on both variables are perfectly matched,' we obtain 
a correlation coefficient of +1.00. What would happen if high scores 
tended^ to be paired with high scores (and lowd with lows), but the match- 
ing was less than perfeyct?. We would expect an imperfect positive cor- 
relation — higih z scores Would be multiplied by "relatively" high z scores, 
but ^ the resultants products would have somewhat ^ lower values than in the 
previous example. Similarly, low (negative) z scores would be multi- 
plied by "relatively" low z scores, yielding positive produ^l^ that are 
slightly <lower txv value than those obtained when the correspondence is 
perfectf. Let's see how this occurs. 



Example 



r 



Student 






z z 


A 


+1.50 


+ .50 


' + .75 


B 


+ .50 


+1.50 


+ .75 


C 


.00 


.00 - 


.00 


D 


- .50" 


-1.50 


+ .75 


E 


-1.50 


- .5*0 


+ .75 

* 








= 3.00 



^ = + .60 

V 



Thus, a fairly strong but imperfect correspondence between scores on 
both variables yielded a moderately high positive correlation. Note that 
the Zx and Zy above scores are the same as in the preceding example, but 
the pairings have been changed. 

Why do we obtain a zero correlation coefficient when there is abso- 
lutely no association betwe^ variables? In terms of the z«*score formula, 
we would be multiplying positive values by negative (or zero) values in 
some cases, and by ^positive values in other cases. The result would be ^ 
some positive ZxZy*s and some negative z^z^*s which, when hummed together, 
woi^ld mostly cai^cel each other' out. Using the same z scores as above, 
but different pairings, let's see what occurs when performances dn X and Y 
^show almost no relation to one another. 



1 



V 



Example 











Person 


X 


JL 




A 


+1.50 


- .50 


- .75 


B 


.50 


+ .50 


+ .25 


C 


.00 




inn 

• UU 


D 


- .50 


-1.50 


+ .75 K = 


E 


-1.50 


.00 


.00 








= .25 

• 



« +.05 (very low) 



Finally, why do we obtain a negative correlation when the relation- 
ship between variables is inverse (i.e., high with low; low with high)? 
In teruB of the z score formula, we would be multiplying high (positive) 
scores by low (negative) scores and vice versa. The result would be all 
(or mostly all) negative values in the ZxZ{/ column which, when summed, 
produce a negative value. Watch how this happens in the following example 
in which the obtained relation shows a perfect inverse correspondence 
between scores. 



Example 



subject 

A +1.50 -1.50 -2.25 

B + .50 - .50 - .25 

C .00 .00 .00 

D - .50 + .50 - .25 A = ^'"^ = -1.00 (perfect 

E -1.50 +1.50 -2.25 . negative) 



-5.00 



EZ^Z^ -5.00 

The above example are intended merely to illustrate how^the z score, 
formula* works. If you understand it conceptually, reconstructing the 
formula for actual use should present little problem. If you do not 
understand it conceptually, there is no need to becoma. concemed—sach 
understanding may coi^e in time. The question for now is— -if given the 
formula and ftairs of z scores, could you use it to compute the correlation 
coeffecient? If you answer "yes" feel free to continue further; an ans- 
wer of "no" implies the need to review the five example problems presented 
above. ' 

In conclusion, the z sco^e formula constitutes a fairly simple means 
of computing Jt, provided that all raw scores have been converted into 
z scores. If such is not the case, a more appropriate formula to employ 
is the "raw score" formula to 1>e described below. 



167 



18- 



Raw Score Formula for Copg)utlngyt 

At first glance the raw score formula .looks formidable and threat- 
ening; In fact. It has the potential to throw apprehensive students^- 
into 8hock«^ The effects, however, are reversible — Just make sure oxygen 
is near before you read further. Seriously though, the raw score formula 
Is merely a derivation of the Z score formula Just explained; thus, for 
any set of data It will produce the exact same resu^. Unlike the z score 
formula. It Is designed for use with dlstributlontf^hat contalrt'only the 
raw score, l,e,, actual scores > When you examine the raw score formula 
below*, your first reaction may be: "none of this for me, I'm going to 
use the Z score formula for computing all correlations I" Easy to say, 
butchery difficult to do if you are not -given Z scores. With a hand 
calculator using the raw score formula should present very little pro- 
blem, provided that yo\j carefully derive all the Values that must be' 
entered into the equation. Good work h^t^lts are very Important. 

The formula is as follows: 

fi = nZKV - zKzy 



V 



(EX)- 



Collect your sdnses, and try to overcome any inclination to 
physically abuse this book. Ask yourself what types of values are 
needed for entry Into the formula. If you do this, perhaps you'll 
realize that the requirements are really quite simple, although the 
mathematics do leave room for error. To work the formula, all you need 
to compute are: 



simple t(St do; add up all the scores on the X variable 
Just as simple; add up all Y scores 
add up the squared of all X scores 
add up the squares of all Y scores 

a little more complicated, but not much; multiply each 
X score by its corresponding Y score. Then add up the 
resu|iitant crossproducts . 

Once you do these five things^ you are ready to work the 'equation. 
If you are .careful and systematlc^n entering values and manipulating 
them arithmetically, there is no reason to anticipate anything but 
perfection in your final answer. 




ans 



168 

•18. 



Example 

Conqjute the correlation coeifficient for the following test scores. 
(ReiDeiil)er, for present purposes, the formula does not have to be 
tnemorlzed ; refer to it freely without experiencing guilt.) 



Child Arithmetic (X) 

A 3 

B 2 

C 4 

D 6 

E 5 

F 1 



Reading (V) 
6 
A 
4 
7 
5 
3 



Obtain EX and EV by summing down the appropriate columns. Add new, 
appropriate columns for X^, y2, and XV as shown. By summing down such 
columns, we coi}ld obtain the desired formula entries. Also, note that 
some of , the formula entries must be multiplied by n, the number of 
scores /(6) . 



,2 



/2 



Arithmetic (X) Reading (V) X_ y__ 



xy 

18 
8 
16 
42 
25 

• 3 

Ey'=151 EXy=112 

pw we can begin to simplify the equation; we start with: 

/I - nlKV - IKIV - , . 



3 
2 
4 
6 
5 
_1 
EX=21 



6 
4 
4 
7 
5 
_3 
Ey^29 



9 

4 
16 
36 
25 
_1 
EX' =91 



2 ~r „w2 



36 
16 
16 
49 
25 
9 



V 



nEX^ - (EX)' 



V 



~* f 



Substituting for n, we get: 

6EXy - EXEV 



n. = 



Y 6EX^ - lEX)^ _ 



T 



Subs 



tltu kng for 



tXy,EX, Ey,EX^ and EV^ we get; 



ERIC 



(6) (112) - (21) (29) 



^ t6) (91) - (EX)^ ^(6) (151) - (EX)^ 

2 2 
Substituting for (EX) and (EV) we get: 

(21x21) (i9x29) 

(6) (112) - (21) (29) 



^ (6) (94) - 



44 



7 V 



'(6) (151) - 84t 



\ 
i69 



63 



(10.25) (8.06) 



« .76 (a fairly high positive relation) 



One More : 

A teacher wishes to correlate nutcber of absences during the week 
with' end-of-week quiz scores . The data he/she obtains Is shown below. 



For practice problems we are restricting 
the purpose of facilitating computations 
that In all problems, on^-dlglt nuiiA>ers 
Again 9 this Is only to make the computa 
you may be dealing with test scores tha^ 
three digits, and thus, computing /t s 
a calculator or computer. 



le size considerably for 
Also, you may have noted 
ave been used for score values, 
ons easier. In actual practice, 
are\ expressed In two or even 
uld not ever be attempted without 







Quiz 


x2 




XV 


Child 


Absences 


(X) Score (V) 




A 


5 


1 


25 


1 


5 


B 


0 


5 . 


0 


25 


0 


C 


1 




1 


16 


A 


D 


3 


3 


9 


9 


9 


E 


2 


. 3 


2 


2 _9 


6 




EX=11 * 




ZX =39 


zr=6o 


zxy=24 



nZKV - ZKZV 



V 



2 V^ZTTIiT 



nZX' - (ZX)" Vnzr - [ZV] 
(5) (24) - (11) (16) 



V 



(5) (39) - 121 



X5) (60) - 256 



120 - 176 




57.06 



-.98 (high negative relation) 



Correlation Coefficient for ^nked Da|ta;^jy 

On occasion, the data you wish tol* correlate may be presented In the 
form of ranks rather than "regular" (I.e., Interval or ratio) scores. 
For example, suppose that Instead of [receiving scores on math and 
spelling tests, students were rank-o ride red by their teachers In each 
subject. Mary, for example » was ranked #1 In math and #5 In spelling, * 
Charles was #6 In both. Ranks, Just lllke numerical scores, can be cor- 
related. "In fact, a British psycholcjiglst named Charles Spearman derived 
a special foltflla for use with ranks J We call the correlation Aeffl- 
clent that It yields the Spearman correlation coefficient (what else?) , 



\ 



170 ' 

' 18, 



and ayiDbollse It as (note the "A" standa for Spearman. Tricky I). 
For all practical purposes, behaves like, and Is directly comparable 
to, plain old ft . The formula Is as follows: 

, 6 to » 
/l^ - 1 - 2 

where P refers to the difference between each pair of ranks, and 6 Is a 
constant that Is always applied. The symbol n, of course, refers to the 
number of pairs of ranks. 

Here's an example of how students were ranked In <1) neatness and 
(2) popularly by their homeroom teacher, Mrs. Fylnkleworth: 

4^^ 



Student 


Neatness 


Popularity 


V 


Bltito 


6 


6 


0 


^orla. 


1 


2 


-1 


Frank 


5 


A 


1 


Aaron 


2 


1 


1 


Sylvia 


4 


3 


1 


Grace 


3 


' 5 


-2 



0 

1 
1 

.1 

1 



Substituting: 

. 1 ^ 6(8) „ i^_i8 
^ 6(36-1) 6(35) 

- 1 - "Hq - 1 - .229 - .77 r , 

All we did was: (1) take the difference (P) between edch person's 
rankings, (2) squard those differences (P^), and (3) sum them to get 
EP^. The rest simply involves substituting this value and that for n 
in the formula. Our result was - .77, a fairly strong positive cor- 
relation. For this particular class,- rankings In popularity tended to 
be associated with rankings In neatness. 

Note: Suppose the teacher couldn't decide who to rank lowest 

in popularity, Bluto or Frank. If the two boys are tied , 
what rank should each receive? Answer: Assign each the 
average of the ranks involved In the tie. In this case 
(Ranks 5 + 6)/2 - 5.5 for each. 

Suppose two people tie for first? What rank do they get? 
(Ranks 1 + 2)/2 - 1.5 for each. 

Bonus: Suppose three people tie for 2nd? 
Each gets (Ranks 2 + 3 + 4)/3 " 3 for each. 



171 



Final Notes : * 

1. The Pearson correlation coefficient (A) toeasures the degree of 
relationship between two variables. Two formulas for its compu- 
tation were described, the z score formula and the raw score 
formula.' 

2. If you wish to Correlate ranked data, use-.£ti<^\8pearman formula (^^)« 
The values it gives have the same range* (-ra^ to -H.OO) as the 
Pearson and are interpreted in the same way. 

3. Speaking of interpretation, what constitutes strong or weak corre- 
lations? Well, that^|t*^lly depends on what youVre correlating. 
In educational and iti^ behavioral research, correlations greater 
than .50 in absolute"^ value are generally considered large. In 
the physical sciences, where things are supposed to operate in an 
orderly fashion, correlations as large as .90 may be considered 
low. Don't think of correlations as proportions or percentages — 
.80, for example, isn't 80Z as strong as its maximum level. As 
you gain experience in using^ correlation in your field, yiu will 
begin to gain a perspective on what levels to expect. When you 
get to Unit IX in this book, you'll find that there are ways of 
testing the statistical significance of correlations, determining 
the probability that the coefficient you obtain differs from zero, 
given your sample size and other factors. When you get to higher 
level statistics courses, you'll find how correlation relates in 
mathematical and conceptual ways to numerous analytical procedures 
(multiple regression, discrdj^mlnant analysis, factor analysis, \ 

f analysis of covariance) . An excitii^ world awaits you (after you 
part with us) . 

•5 POSTTEST (answers in^back) 

1. Give a logical explanation for why the z score correlation formula's 
mathematical properties will yield: 

a) a positive correlation when variables are directly (positively) 
related. 

b) a negative correlation when variables are inversely related. 

c) no correlation when variables are unrelated. 

2. A school administrator is interested in the relation between parents ' 
income and the scores their children obtain on a nationally adminis- 
tered scholastic aptitude test for third graders. He randomly selects 
nine students from his district and tabulates their parents' annual 
incomes along with their test scores in 2 score form: The resultant 
data is shown belo^. What is the correlation between these variables? 
(Carry out this problem to completion.) 



172 



187 



student Parents* Income (z^) Test Score (Zy ) 

A -1.50 -1.20 

B -1.20 -1.50 ^ 

C - .80 - .40 

D - .40 + .80 

E .00 . .00 

F + .40 - .80 

G + .80 . + .40 

H +1.20 +1.50 

I +1.50 +1.20 

Feel free to turn baclhand re-examine the Z acord fonmila. 



The table shown below gives the raw scores 5 students tecelved on 
reading and spelling tests. Compute the correlation coefficient 
for these measures. (This problem does not have to be carried out 
to completion - It will suffice If you merely enter the appropriate 
numerical values Into the raw score equation.) 

Student Reading (X) Spelling jV) . » 

J 3 5 

• K 1 2 ' - , 

L 2 5 

f M 4 3 ■ ' 

7 N 1 1 • 

In the Miss Soybean Contest, the 7 flnallstA were rank-ordered on 
-the basis of congeniality and ^scle tone. Use the Spearman Formula 
to determine the correlation between the two rankings. What do the 
results Imply? 

Contestant Congeniality Muscle Tone ^ " 

A 1 6 

B 3 4 

C ' 6 5 

D 4 3 

E 7 2 

F 2 • 7 . - 

G 5 ' 1 • . 



18o 



Unit V Review Teat 
(answers In back) 



Select one answer for each of the following: 

Correlational analysis Is used to 

a. determine the extent to which variables can be 
predicted from constants. 

b. detenalne the effect of s^he Independent variable on 
the dependent variable. 

c. ^termlne the relationship between two variables. 

d. (fftermlne the relatloT\shlp between two constants. 

e. none of the above. 

2. Which of the following statements. Is true given that a cor- 
relational study is to be conducted using 30 people (n= 30)? 

a. Half of the observations (i.e., 15) will provide scores 
on variable X> whereas the other half will provide scores 
on variable Y. 

b. The nuni)er of points on a scatterplpt will be 30. 

c. The sample size will be 30. 

d. The size of the sample is insufficient to permit a ^ 
correlational analysis. - ^ 

e. b. and c. are correct. 

3. A correlation is determined for the length of daiys (sunrise 
tb sunset) and the daily outside temperature at 12:00 noon. 
The units of analysis (i.e.^ the aamjfLe from which data are 
obtained) would consist of 

a. people ' 

b. days i 

c . thermometers ^ 

d. clocks 

e. none of these 

h. The pattern represented on^a particular scatterplot shows 
points that slope downward from the upper lefthand side of 
the graph to the lower righthand side. Many of the points 
deviate from positions th^t would make up a straight line 
pattern. Based on this scatterplot, one could conclude that 
the correlation between the variables ^.nvolved was 

a. a perfect positive . ^ 

b. an imperfect positive 

c. a perfect negative 

d. an imperfect negative 

e. 26ro 



1. 



18 J 



174 



_5. Which of the following 8et» of variables would yield a 
correlation of -1.00? * 

a. Helgjit and gyniiastiVs ability for a sample o# eighth- 
grade girls. 

b. Driving speed and gas consiuDptlon for a saniple of cars 
traveling the same 10-mile stretch on an Interstate 
higlhway. 

c. Body weight and physical coordination for a sample of 
adults who consumed 12 ounces of alcohol. 

d. Distance already traveled^^and* distance still remaining 
to finish for a sample of driver»s two hours into a 500 

.mile race. 

\ e. Number of pounds of food consumed during a week and total 
calorie intake for a sample of high school students. 

_6. Tor a perfect positive correlation to be obtained, which of 
the foliowlng conditions mpst prevail? 

a\ The highest person on variable X must also be the highest 
on variable Y. 

b. When plotted, all points must fall on a straight line, 
•c. The direction of the scatterplot must be from lower left 

comer to upper right corner, 
d. The lowest person on Y must also be the loweist pn X. 
,e. All ot the above* 

7. If, for a sample of 1000 physically normal adults, right leg 

length were to be correlated with left leg length, the resultant 
coefficient would probably be closest to 

a. -.60 

b. -1.00 

c. H-.54 4 . 

d. +.98 

e. -.01 

_8. A businessman finds that the correlation between age and pro- 
ductivity of his employees is « .50. A correct conclusion 
would be 

a. Age is a cause of good productivity. 

b. When you produce more^ you age faster. 

c. Age and productivity are related. 

d'. Young workers tend to be more productive. 
e» Older workers should be retired immediately. 

9. Number of houi^ devoted to BtuJy correlates with hlgh\ school 
grades at +.20. Number of TV shows watched at night cor-^ 
relates with high school grades at -.40. If these were "real" 
results, which variable would be ^t]|^e^tter predictor of grades? 

a. Study hours 

b. Number of TV shows 

c. Both would be equally effective 

d. An answer cannot be determined without additional data. 




ERLC 



175 * 



to. Suppose that running endurance (l^e., distance one can 

run at the pace of 9 mln, per mile) correlates positively 
' with the amount of time one can hold his breath under 
water. Tim Is a person who caii run great distances at 
9 mln, per mile, but can hold his breath for only a fairly 
short time. His addition to the sample would 

a. weaken the correlation, 

b. strengthen the correlation • 

c. liave no effect on the correlation. 

d. an answer cannot be determined wlth6ut additional 
data. t ^ 



A 



Check the answers in the back and then rate yourself, 



Number Correct 

9-10^ 
7 - 
5 

Below 5 



Scoreboard 

Excellent 
Fair-Good 
Poor 
^or^ ndous 




PROBAMLITY 



■ 5* 

General Oblectives 



^ Remeiaber Unit IV on the normal curve? Uh»her you realized It 
at the tlme» the area problems you were workii^ are probability problems, 
and the normal curve is, in essence, a probabijlity modal. Given the 
impog^ce of probability to statistics (and you wilt see this come Into 
^play quite strongly in Units VII-VIII), this unit tpro^ides an orientation 
to this subject. We're betting that you'll "probabljr^' benefit from the 
experience. * 

Specific Objectives • 

6.1 Define probability L ^ * 

6.2 Oompute probabilities on a single trial using (a) addition rule 
for mutually exclusive events and (b) addition rule for non- 
mutually exdusive^vents 

6.3 Compute probabilities over Ihultiple trials using fa) multipld- a 
cation rule for independent trials, and (b) multiplication rule Jjl 
for dependent trials ' 

6.i Instructional Unit ; probability: What is it? 

Let us get right' to t^he point by presenting a definition: 

The probability that a particular event will occur 
is the relative frequency with which that event can 
be expected to occur. 

Sound a bit circular? The key term here is relative f requency. 
If there ate io patients seeking medical attention and 12 are males, the 
relative ireqiency of males is 12/30 or 2/5 or .40. refer to the 
tx)tal group OA the sample space, the. total number of events in question. 
So, we determine probability by the formula: 



n(A) 
n 

Where n(A) islthe number of Jnstances of Event A, and n is'^thcsize of 
the sample "sp4ce. P(A) , therefore, represents the relative nqgber of 
times Event A ifc represented in the sample space, i.e., its relative 

f requeney- ^ ' j 

Unlike previous units, this one will de-emphasize discussioi)i and 
rely mostly oh presenting tl»e material through rules and examples. Hope 

you lilte Ik! ' ' • 

t 



177 



19^ 



7 



Example 1 ; To return to our original example, but using different 
numbers, suppose that a group of 50 patients contains 12 who need 
immediate medical attention. What is the probability that if a 
patient is selected at random to be treated first/ he/she will be 
one of those 12? What we* re teally askifig is "what is the relative 
frequency of patients needing immediate attention (in that group)?" 

o/T V n(Im« AttenQ v .. ^ . 

• r(im. At tea.) = ^-r; ' ^ 

I: ^ all patients 

^ Since n(Im. Atten.) = 12 and sitice "all patients" or n = 50, we sub- 
stitute to get; 

P(Im. Atten.) = 12/50 = 6/25 or 24% 
Thus, there is a 24% chance (6/25) of selecting one of those patients. 

■ * 

Example 2 ; Teachers need to be aware of guessing factors on their tests 
I as such could result in invalid results for certain students. Suppose 

that Mrs. Jones asksl^lk multiple-choice question that has one 'correct 
an^er and four distractors (incorrect answers). Moose Crenshaw, who Is 
allergic to studying, takes a random guess by picking alternative "b." 
Since we don't know the actual position of the correct answer, what can 
\ be said about the probability of Moose's guess being correct? Set-up is 

* V/ *.\ n< correct) » ' • 

"(correct) = — - 

Where Kl(correct) is, in this case, the number of correct alternatives ^ 
(there's ^) "Snd W is t\ie total number of alternatives (5). 

Ans: P(correct) = 1/5 or 20% ♦ 

Moose's chances aren't good. He'd be better off with* a true-false test. 

Example 3 ; Medical personnel always have to worry about th6 condition 
of their equipment, whicji is often beyond their control. In a new ship- 
nent of 15 blood pressure gauges, 8 are defective and give, slightly in- ^ 
accurate readings . Nurse Needles selects one at random to take a reading 
on patient Jones, What is ^e probability that. Needles has picked one 
of the defective instruments? 

Ans: P(Defect) = Hi^^l^ • [" 
P = 8/15 or 53.33% 




6.1 PO! 

1. Def)ine what prolsability is in your own words. 



ERJC 



• / 



2. In ^Q^lass consisting of five students, two are females and three 
are males. Wh^ 
' if a last name 



are males • What Is the probability of selActirig a male student 
ame is called out at random? ^ 



3. In a group of 60 people,, 10 have type 0 blood, 30 have type A blood, 
10 'have type B blood, and 10 have type AB blood. If one of them 
gets inland needs a transfusion, what is the probability that the 
type of blood needed will be type 0? 

ilbA Instructional Unit : Computing Prpbabilities on a Single Trial 

Before beginning our discussion of single trial probability , it 
would probably be helpful to call your attention to some key .vocabulary 
that will appear in definitions of the two rules to be covered. We will 
define them here once and then again when the rules axe actually intro- 
duced. It may not be that important to study them (or worse, memorize 
tbem)now. They are presehted mostly to provide an organizer**of sorts: 

Mutually Exclusive ; A condition that prevails when two (or more) 
events cannot occur together; i.e., cax>not take place at the* same 
time. 

Example ; You purchase *a prescription from your pharmacist. 
The specific medication purchased can be a liiquid or a pill, but 
not both a liquid and a pill (at least at the same time). 

Nonmutually Exclusive ; Well, would you be satisfied if we said 
it is the opposite of mutually exclusive? Probably not. Non- 
mutually exclusive refers to a condition where the e>5^nts in 
question can occur together, i.e., can take place at the sam6 time. 

Example ; You are interested in 'the results of tests involving 
the effects of new drugs in treating laryngitis. You are also ^ 
interested in the effects of drugs for treating bronchitis. You 
select a report on a particular drug at random. " Could it be a drug 
*that is effective for treating both laryngitis and bronchitis? Of 
course. . .those conditions are nonmutually exclusf^e. 

Rule 1 ; Addition Rule for Mutually Exclusive Events 

When events A and B are mutually exclusive, the probability that 
either A or B will occur is P(A or B) P(A) + P(B) 

Thus, all we are doing is adding the separate probabilities associated 
with events A and B. 

Example 1 ; In a math class of 20 students, 5 are high achievers, 12 arB 
middle achievers, an^ 3 are low achievers. If a student Is selected at 
random, what Is the probability that he/she will be either a high achiever 
or a middle achiever? 

P(High or Middle) « P(High) + Peddle) " 2I H " 1^ 

• " * 



179 



ERIC 



f 



Example 2 : A medlca:!^ staff has 6 wards under its reponsibility , 4 
involving geriatric patients and 2 involving pediatrics. If a super- 
visor randomly selects one ward to inspect, what is the probability 
of its involving either geriatrics or pediatrics? ^ 

P(P or G) » P(P) + P(G) f + f = f " 1 or 100% ^ ^ 

(Since all the wards involve either pediatrics or geriatrics, - 
the probability is 1 (lOOZ) that one* of those types will be' 
inspected.) 

Example 3 ; There are 12 student desks in the classroom: 3 have broken 
seats and 5 others have broken tops. If a student selects a desk at 
Random, what is the probability of either its seat or top being broken? 

P(Seat or Top) = P(Seat) + P(Top) = ^ + ll ""if f 

Example 4 ; Of every 100 maternity patients who are interviewed by the 
oiursing staff at City Hospital, 50 indicate the desire for natural 
childbirth, 25 prefer general anesthesia, and 25 prefer local anesthesia. 
A nurse meets a patient and, without knowing her position, makes deroga- 
tory comments about using anesthesia. What is the probability of that 
patient being one who actually prefers either local or general anesthesia? 

P(Local or Gen.) « P(L) + P(G) = -j^ + ~J « -jg or or 50% • ' 
•■^ ' , * ■ 

Example 5 ; A teacher grades test results such that of her 21 students, 
5 receive A'fi^ 6 receive B's, 6 receive C's, ? receive D's, and 2 receive 
F's. On this particular test, what wa& the probability of receiving 
either a D or a F? . , ^ 

R(D or F) - P(D) + P(F) 2T 21 " 2? ' 

Rule 2 i Addition Rule for Nonmutually Exclusive Events ' . ^ 

' \ 
When events are nonmutually exclusive, P(A or B) « P(A) + P(B) - P(A+ff) 

Once ^gain, we are determining the probability of selecting either 
A or B on a swingle draw. The change, however, is due to the condition 
of events A and B not being mutually exclusive; i.e. they can occur 
together. A© a result, a new expressipn, (A + B) , enters into the for- 
mula. It refers to the probability of the outcome containing both A and 
B. The logic of the formula should become clearer as we work some 
examples . , * 

Example 1 ; a staff of 25 physicians, 5 have had special training ijp 
cardiac arrest (CA) and 10 have had special training in second and third 
degree burns. Of these^ 3 have' had special training in bp^th- If a 
physician is selected a% random, what is the probability thap he/sKe 
had had training in CA pr serious bums or both? 



180 , 

13, 



\ 



P(CA or Burns) = P(CA) +'^/(Bum8) - P(Both) 

5 ^ 10 3 12 _ /fl. 
■ 25 25- 25 - 25 °' ' 

Why is the P(Both) subtracted? Those 3 physicians are part of the 
group of 5 with CA training, rightt\ But, tshey are also part of the 
group of 10 receiving burn training, right tool So we are sub- 
tracting them 80 they won't be counted twice. Look below. 




BURNS = 7 
ONLY 



CA ONLY 





Example 2 ; Y6u have a class of 14 students, 3 of whom have problems in 
reading and 6 have problen© in math. Of these, 2 have >robleins in both 
reading and math» What is the probability of selecting a student who has 
problems in reading or math or both? 

P(Read. or Math) - P(Read.) + P(Math) - P(Both) 

Again, do *you see why the P(Both) is subtracted? Those 2 students help 
make up the group with reading problems and also the group with math 
problems. We subtract them so that they're not counted twice. 



ERJC 



I9y 



181 



) 

Example 3 : 'There are 20 pbst-operative patients being monitored. 
Following rovmds, 8 are Identified as showing abnormal respiration 
patterns and 8 are Identified as having uneven pulse rates. Of * 
these, 5 are evidencing both --types of problems. Naturally , such 
problems #re cause for concern. Vftiat Is the percentage (probability) 
of patients having respiratory problems or pulse rate problems or 
both? 

P(Resp. or Pulse) « P(R) + P(P) - F(Both) 

: e ^ + -1 . -1 o li or 55% 

20 20 20 20 

s 

Example 4 ; Four types of bacte^V^ are Identified In a sample of city 
drinking water: 1 Is shown to produce acute gastroenteritis and 2 
produce allergy-type symptoms. Of these, 1 produces both gastroente- 
ritis and allergy-type symptoms. If you were told that on% type of 
bacterium was concentrated In your water, what Is the probability of 
It being a type that produces gastroenteritis or allergy-type symptoms 
or both? 

P(Gastro. or Allergy) + P(G) + P(A) - P(Both) ^ 

12 12 1 • 

Example 5 ; Of 64 third graders, 5 have a history of frequent absenteeism 
arid 15 have a history or behavior problems. Of these, 3 have a history 
of both types of problems. For this student population, what Is the 
probability of a student having a history of attendance problems, behavior 
pi^oblems, or both? ^ ^ 

P(Absentee or Behav.) P(Absentee) + P(Behav.) - P(Bpth) 

_5 15 -2 * 
64 64 " 64 

» XT' or 27% 
64 

6.2 POS^TEST (answers In back) 

1. Indicate whether each of the following pairs of categories (events) 
wou^d be mutually exclusive or nonmutually exclusive for Individuals . 

a. The sex of patients In a ward: male or#emale. 

b. Courses that A's are received In: history or math. 
Membership In* different medical societies: national or state. 

d. Grade on yesterday's math exam: A or B or C. 

e. Type of class scheduled for 1 p.m. today: gngllsh or home 
economics. 

I 



ERIC 



182 J 9 ,' 



2. Find the probability for each of the following: 



a. Of a D or ari F in biology, given that in a class of 25 students, 
F's - 3, D's - 5, C's « 7, B's - 1, and A's - 9, 

b. Of a student participating in a sport or a club or bot> after 
school, given that in a sophomore class of 210, 22 students ar^ 
in clubs, 60 play sports, and 2 participate In both. 

c. Of iuperson exposed to , a particular pathogen developing anemia, 
ulcers, or both, given t!hat out of 65 people formerly exposed, 

8 developed anemia, 10 ulcers, and 8 developed both. ^ 

d. Of an incorrect temperature reading being taken, given that of ^ 
■ 15 available thermometers, 10 are accurate, 3 give overestimates, 

and 2 give underestimates. 

* 

-3. Explain in your own words why it is necessary to subtract P(Both) in 
problems involving nonmutually exclusive events. 



6.3 Instructional Unit ; Computing Probabilities Ovei; Multiple Trials 

this section presents rules for computing the probability that each 
of a aeries of events will occur across trials. This description could 
make a relatively simple idea sound confusing, so let's consider some 
examples . 

In the previous unit we restricted ourselves to one-trial possibilities 
such as: "Given 20 drugs of which 10 are effective, what is theproba- 
bility of picking an effective one at random?" One_trial (i.e., one pick) 
is involved. 

But, suppose we said that there are two ^tsf of drugs in different 
rooms each conCeining 10 effective ones out of iO. What are the chances 
of randomly picking an effective drug in both? Now two trials (picks) 
are involved. Based on what we learned be'fore, we co»|Ld figure out that 
for Spt 1: • 



The same goes for Set 2. But, still what is the probability that 
an effective drug will b^t^^elected from both? 

Or, suppose Moose CrenshavM remember him?) takes a random guess on 
two multiple choice items in a row. If an item has one correct answer 
and four distractors, we know that) the chances of a lucky guess on one 
itemlT'l/S or 20%. But, what art/'his chances of getting both rightr 
The difference here is that ^ce>6ve two trials (or probability situations) 
going, rather than only one. 



183 19. 



N(|w, Just one more idea. Try hard on this one. Trials can be 
independent or dependent. If independent , what occurs on one does not * 
affec|t what occurs on the other. The two examples above fndicate 
indepfeodent trials. Why? If you happen to pick an effective drug in 
Set 1, that has no effect on your chances of picking one in Set 2. 
Given that Moose makes a completelyrandom guess on both items* his 
success (or lack of it) on one IteiFnas no bearing on his fortunes on 
the second. The twu trials are independent. 

Dependent trials, on the other hand, involve situations where what 
occurs on one trial does have an effect on what occurs on the other. 
Suppose you make twp selections from one class of 20 students. What 
are the chances that both will be male if the* class has 10 males and 
10 females? For selection #1, we hav^ 

P(Male) = " 50%. But, suppose a male ±b_ picked on that trial? 



ERIC 



For Selection 92^ there are only 2i students left! And, there are only 
9 males left I Why? Because one student, a male, has already been picked, 
So here, what happens on one trial does affect the probability results 
for the other. 



Rule 3 : Multiplication Rule for Independent Trials 

Let's concentrate now on the independent trials case and return to 
dependent trials when we cover our VS^t rule. / 

The formula (Rule) for determining probabilities on independent 
trials is: ' 

i M 
P(A) • P(A)*...*P(A) - P(Ar 

n - number of trials J 

Thus, we raise the probability of A to a power equivalent to trials. 

If P(A) ■ 1/5, ^hat is the chance that A will occur on two trials in a 
row? 

Ans.: F^(A)^ - (j) ^ • ^ or 

Let*s try some more meaningful examples. 

• ■ 

Example 1 t A student makes a completely random guess on each of three 
T-F items on a quiz. The probabiM-ty of making a correct guess on any 
one item is 1/2. What is the probability that he will make three cor- 
rect guesses? 

3 

Probability - ^(Correct Guess) 

„ (if 1 . I . I - 1 or 13% 

\2/ 2 2 2 8 

There is one chance in eight of guessing all three. 

1 

184 



r 



As, 



Exkmple 2 ; Based on statistics for a. rare fortn-'of malaria, the chances 
of contraction being lethal are one out of three. What is the proba- 
bility then that two individuals who independently contract the disease 
will both die from it? 

Probability « P(Death)^ 

Example 3 : The probability of event A is 2/3. What is the probability 
that A will occur in four out of fi^ur trials? 



Probability » P(A)^ « (|)^ 



2.2.2.2 16 

" 3* 3 •! •a- 8T"20% 

Example A ; The State Board of Education publishes figures indicating 
that the chances of an applicant being accepted for a teaching position 
in a typical school district are 2 out of 5 (2/5) . Joe Smith submits 
applications to two typical districts. What is the probability that he 
will be offered a position in both? 

Probability - P(Accept .)'^ « 

-2.2 

5. 5 ' 
or 16% 

Example 5 ; There are six cards, one of which is the Queen of Diamonds. 
A card is selected at random and then put bacic on a given draw* What 
are the chances that if three draws are made the card will be the Queen 
of Diamonds? 



3 

Probability - P(Q-D)'^"(|) 



i • • i 1 
" 6 6 6 " 6^6^6 

1 ' 

"216 

Rule 4 ! Multiplication Rule for Dependent Trials 

Dependent trials, as described earlier, occur when what happens in 
one trial affects the probability of the event in question taking place 
on another trial. Don't panic quite yet*. • 

(Suppose there is a class of 28 students. Fifteen of them are A 



185 



ERIC 



students In your book — a pleasure to teach. If 2^ students ^T^PP 

class were to transfer to new schools » what Is the probablll^^ that both 

will be aiix)ng the A group? 

Our reasoning Is as follows: the first selection, we have n » 28 

and A » 15. Thus» the probability of the first student to transfer bf^ing 
one of the A's is : 

15/28 or 54X 

One A student down, but to finish the probability estimation we need to 
know the chances of both transfers being A's, Well, if the first is an 
A, how many A's are left? 

Ans#: _14 out of the original 15. * 
How many students are left following the first transfer? 

Ans.: 27 out of the original 28. 
What's the probability of A for trans.fer (or trial) //2? 

|y or 52% ^ ^ \ 

See how the two trials (i.e., probability estimates) are dependent ? Your 
pick on Trial^ 1 affects the number of critical events and the size of 
sample space on Trial 2, If an A student is the first transfer,^ isn*t 
there less chance than an A student will also be the second? Yes---there 
will be relatively fewer left in the class. 

The formula for dependent trials is: 

J, 

Probability - P(A) • P(B) 

t ' , . ' 

Think of A in this case as the probability of the event occurring on 
Trial 1 and B as the probability of its occurring. on ^ Trial 2. Thus, the 
formula refers to the ptobat^lity of the events happening in both trials 
1 and 2* 

Thus, in the example: Probability « P(A)« P(B) ^ ' 

11. 11 m 15>14 
" 28 27 28*27 

-rvT or 28% ' 
756 

Let*s try some others: 

Example 1 : A cdatching test has 10 names in col4imn A and 10 titles in 
column B, with the task being to match one correct title to each name. 
If a student has not read the materials and makes a completely random 
title selection for name 01^ an d^ then a completely random selection of 



186 

20i 



the remaining titles for name #2, what Is the probability that both 
guesses will be correct? (We'll subsHtute C-landC-r2 to indicate 
"correct" on sejlections 1 and 2» respectively.) ' • 

Let C stand for correct guess. , 
\ - 

Probability « P(C-l)* P(C-2) 

Thus» the formula reads: ^ 

"The probability of a correct guess on both Trials 1 and 2 equals 
the probability of p. correct guess on Trial 1 multiplied by the pro- 
bability of a correct guess on Trial 2/' What is (C-1)? That is, 
what are the chances that the student will guess the correct title for 
the first name? How many titles? 

Ans t 10 * ^ ; 

P(C-l) « 1/10 or 10%, right? 

But now for ttie dependency. ' 

If he has guessed right on name 1, what is (C-2)? How mapy titles 
remain to choose from? 

\ 

Ans.: only 9 - one has already been used. Thus, 
P(C-2) - 1/9 or 11.11% 

Does our student have a good chance of guessing the first two 
correctly? ^ 

P(C-1 and C-2) - P(C-l) • P(C-2) 
10 9-** 

Not a very good chance is there? 

Example 2 ; Suppose the above matching test contained five states to be 
matched to five state capitals. What would the probability be of guess-/ 
ing the first three cortrectly? ' * 

Don't be seated by the "thj^ee" probabilities. -The logic is (still 
the same; Just write the formula (C«correct guess): 

% 

Probability - P(C-1)# P(C-2)*'# P(C-3) 

We've Just expanded it to include the third trial (item). 

P(C-l) - 1/5 or 20% 
P(C-2) - 1/4 or 25% 

Understand why? If he's guessed the first one correctly, there are 
only 4 remaining to choose froip for his second guess. 

s • 



/ 



P(C-3) - 1/3 or 33.33% 



and for his 3rd guess » there are only three to choose -from. 

Ans.: T • T • T ■ 1.67% 

5 4 3 60 ' 

Not too good a chance of guesslhg all three. 

Example 3 ; There are 12 heart patients needing bypass operations. Dr. 
Smith will be performing the operation on 2^ of them. Among the 12 are 
three very difficult cases Involving patients of relatively advanced 
ages. What is the probability that both ot Dr. Smith's operations will 
involve these problem individuals? 

Let P stand for "problem." 

P(P-1 and P-2) - P(P-1) • P(P-2) 

What we're saying is that the probability of both operations (1 and 2) 
involving problem patijents (P'sl) is equal to the product of the proba- 
bilities of a P being picked on two selections. But» those probabilities 
are dependent » as should become obvious below: 

P(P-l) - ^ or 25% 

Only 11 cases temain. Of those, only 2 are problems. Thus, 

P(P-1 and P-2) - yI - TT- tIi °' 2l " ^.55%. • 

Ml 

Not much reason to panic. Odds aren't that great that Dr. Smith will 
get two problem cases for his operations. 

Example 4 ; Students are told not to j)ut comic books in their desks. Out 
of the 28 students in class, 20 obey that rule and remove comics from 
their desks on a particular day. The teacher decides to check^ desks on 
that day, but ends up being able to check only two before being called 
away. What is the probability that both of the ones she checks are 
proper (i.e., no <*omics)? Let's Just use A to symbolize a "good desk." 

Probability - P(A-l) • P(A-2) 
20 5 

P(A-l) - II T y or 71.43X 

On first check, remember 20 out of 28 are good. ^Hut, 

P(A-2) - ^ or 70.37% 
Understand how we get the 197 The 277 

P(A-1 and A-2) - y • ii - ^ or 50.26% 

Thus, there is about a 50% chance that both desks will pass. Students 
better pray that luck is with them. 



188 

2yj 



Exaaple 5 : There are 5 cards In a pile . One Is a heart and t^le 
rest are dub^T^f a card is selected at random and not put back, 
what Is the probability that the first two picks will not be hearts? 
Let NH stand for "no heart." ' 

Probability - P(NH-l) • P(NH-2) 

P(NH-l) - J or SOX 

See that there are 4 other cards available for the first selection. 
There are 5 cards in the sample space. 

P(NH-2) - f or 75Z ' f ' 

Once an MH is selected on first pick, only 3 NH remain in the resultant 
sample of 4. 



Probability - j • ^ " 20 °' 

Thefe'a a reasonably good chance (60Z) that the tW9 cards selected will 
be o«her than hearts. To show you that our "heart" is in the right pla 
here'^re some questions to practice on. ^ 

6.3 POSTTEST (answers in back) 

1. Indi^te which of the following involve independent trials and 
which involve dependent trials. ^ t * 

a. The probability of making a correct guess on each of. a series 

of multiple-choice items. 
b. The probability of randomly matching each of a group of newr 

boms to the corfect set* of parents. • 

c. The probability of picking a* diamond from a deck of cards^ 
— — ' removing it, and then picking a diamond from the remaining 

d. The probability of picking a heart from a deck of cardb, puttl 
^ it back, reshuffling, and then picking another heart. 

2. Compute the probability for each of the following: 6 ^ , 

a. In a class of "20 students, 18 have done their homework. The 
teacher randomly selects one student to work problems on the 
bfffinJ, and then randomly selecft another student. What is., 
the probability that both will be ones who did their home- 
work? (Are these independent. or dependent trials?) ^ 

b. Suppose that 18/20 students have done their hotnework. The 
teacher randomly selects a student to do a problem on the 
board. After the student sit^ down, the teacher randomly 
selects a student to do the next problem (note that the first 
student can be selected again). What is the probability that 
both selections will be students who have done their home- 
work? (Are these independent or dependent?) 



The chance of arresting growth of a particular type of pituitary 
tumor through chemotherapy Is 1/4. vniat Is the prol|»billty that 
In a series of different cases: * y 

a. the 'first two will be successful? 

b. the first three? 

c. the first four? > 

Given a conventional deck of 52 cards, what Is 4;he probability of: 

a. drawing a heart on Draw 1, putting the card back, reshuffling, 
and then drawing a heart on Draw 2? 

b. drawing a k^ng on Draw 1, leaving It out of thfe deck, and thea 
. drawing a queerg^on Draw 2? 

c. drawing a diamond on Draw 1, leaving It out of the deck, and 
then drawing another diamond on Draw 2? 

d. (Bonus) drawing either a 10 or a jack on Draw 1, leaving It fi^t 
of the deck, and then drawing either a nine or a king on Draw 2? 



/ 



190 2Uo 



Unit VI Review Test 
(answers in l^ck) 

There are 32 hospital centers in a foreign country. Of those, 5 
have equipment for renal dialysis and T1 have no such equipment. 
If a hospital is selected at random, 

a. what is the probabilibf that it will not have renal 
dialysis equipment? 

b. what is the probability that it will have renal dialysis 
equipment? 

c. what is the probability that it will either have renal dialysis 
equipment or not' have such equipment? 

Of 127 items on the State Achievement Test in English, 15 cover 
antonyms and 20 cover conjugations, two areas you haven't covered 
well in your course. If an item is selected at random, what is 
the probability ' that 

a. it will cover antonyms? 

b. it will cover conjugations? 

c. it will cover either antonyms or conjugations? 

d. if two iteids are selected, both will cover antonyms (are 
these dependent or independent trials?)? 

e. if two items are selected, the first will coVer antonyms 

and the second will cover conjugations? . * „ 

Differentiate between ^ , 

a. mutually exclusive events and nonmutually exclusive ^events . 

b. independent trials and dependent ti;lals.^ 

Five American students are entered in the World Science Fair 
contest. Altogether there are 316 contestants. Without 
knowing anything about the projects (i.e., assuming all con- 
testants are equally likely to win before the contest starts), 
what is the probability that 

a. the wipnpr will be an American? 

b. both the winner and the secdnd place contestants Will be Americans? 

c. the winner will not be American? 

d. both the winner and the second place contestants will not be ^ 
American? 

e. if separate prizes are 'given for originality and for interest 
value, the winners of both prizes will be Americans (note: 
one individual can win both)? 



191 20 U 



UNIT VII \J 
MAKING INFERENCES FROM SAMPLES 



A. General Objectives 

Statistics deals with what we can know about a l^rgk group 
(or population) by examining a smaj ler group (or sampl^p^ from 
thjLt population. Of course. If complete population data are 
available, then there Is no need to work with (samples . However, 
due to the difficulty of measuring a population, samples are often 
the only way, "rtie way to think about this Is a direct route 
(working with a population) and an Indirect route (using a sample) 
to the same dest^lnatlon. The concepts of random sample and 
Sampling Distribution play Important roles In taking the Indirect 
route, statistical Inference, to our destination. 

B. Specific Objectives ^ 

7.1 Define random sample and describe how to obtain a 
random sample 

7.2 Explain the relationship between parameters and statistics 

7.3 Explain the notion of bias with regard to Sampling ' ^ 
Distributions 

7.4. Characterize the properties of Sampling Distributions 
of means , 
% 7.5 Compute probabilities for obtaining sample means of 
, " various sizes when population parameters are known 

7.1 * Instructional Unit ; Rando^n Samples 

As Indicated above In the 'general objectives" section, there 
are many Instances In which we want to describe a population of Indi- 
viduals In terms of soQie characteristic: What Is the average age of 
Oklahomans (Bet you've always wondered about thatl)? 'What percentage 
of women voters In the United States are registered Republicans? 
These are but two examples of a multitude of questions asked every 
^ day about populations. The problem, however. Is that to assess a 
population directly , you would need to collect the Information from 
all members. See any problem with that? In fact, there are Immense 
problems Involving time and expense; For the average researcher, 
collecting complete population data Is often Impossible. 

So, here's a riddle for you. You've been hired to determine the 
mean and standard deviation of the Grade Point Averages (CPAs) of all 
full-time students currently attending State University. Given the 
time and resources at your disposal. It Is Impossible for you to tedt 
the entire population of students and the data are not presently 
available anywhere. Yet, you're being asked to report population 
results 11 What should you do? Here are some choices: 



ERIC 



2u1 



192 



a. gjBt another job. ^ 

b. find the CPAs for the first 100 students you see on 
campus, and infer the population results from those. 

c. find the CPAs for a representative (rdndom) sample 

of 100 8t»j|dents and Infer the population results from 
those. i 

d. all of the above. 

This may, surprise you, but the best answer is "c." If you cannot 
test an entire population, the first step is to (1) obtain a random 
sample. Then you (2) compute statistics for the sample (e.g., the 
UKan, variance), and then (3) make an inference about the population 
on that basis. In this section, we will examine the first step- 
obtaining a random sample. 

Why random samples ? 

Statisticians use samples to estimate parameters (population data) 
Why do they use random samples? If you do not know the answer, an im- 
portant clue should be provided if we say that randomness helps to- 
obtain representativeness. 

Example 

Mrs. Pedogog, a high school guidance counselor, believes that the stu- 
dents in her school are utterly brilliant, heads f 
the kids attending the neighboring schools. She decides that the best 
way to test this hypothesis would be to compare the IQs of students In 
her school with the average IQ of all students in the city, which she 
knows to be 99. She does not have the time or money to ^est all stu- 
dents in herTchool, so she decides to use a sample, from which she 
can generalize to the school population. Since her office is next 
to the physics laboratory, run by Mr. Molecule, she administers the 
Stanford-Blnet IQ test to everyone of Mr. Molecule's third-period 
(11:00-11:45) students. The average IQ turns out to be a whopping 1^ 
She runs through the halls sh<^uting hysterically, "The kids in our 
school -are brighter than the average students in the cityl 

Why is the above conclusion ridiculous? 

Answer: Mrs. Pedogog used a sample that was not representative 

of the student population in her school. The sample may 
be representative of Mr. Molecule's brilliant physics 
students, but it is certainly biased in reference to the 
total student population at Pedogog' s school. 

Moral of story: Samples save time and money, but in order to be 
valid, they must be representative of the population to 
which they are supposed to relate. Random samples help 
to engender representativeness. 



193 

2U. 



What exactly Is a random sample ? 

or ' • . . , 

What do I have to do to get one ? 

As the name implies, a random sample is obtained by a method ^ 
which guarantees randomness (huh?). The question now is what do 
we mean by "randoraneas?" Actually, the name refers to how the 
sample is selected rather than of what it is comprised. Given 
several samples with no iaformation about how they were obtained 
gives us no basis for determining whether they are random or not. 

I 

^ The idea of "randomness" requires that every element (person, 
place, or thing) used' in the sample had an equal chance of being 
selected. Confusing? Let's break it into steps using the example 
Involving Mrs-. Pedogog. . 

You will recall that Mrs. Pedogog was interested in as/s^fssing 
the "typical" IQ of the students in her high school. Rather than 
test every single student (2,000 in all), she decided to use a sample, 
and from the sample data generalise to the population. 

' To draw a random sample she must do the following: 

1. Clearly define her population : Mrs. Pedogog does ^o by 
saying, "The papulation with which I am concerned con- 
sists of all students in Beaver High School." 

2. Decide on an adeq^uate sample size : This one is up for grabs-- 
obviously the larger your sample, the better your estimate of 
population parameters; Mrs. Pedogog defines her sample size by 
saying, "I will use 30 students in my sample (30 is generally 
a pretty good number, but randiomness does not directly depend 
\^on sample size)." 

3. Select her sample by insuring that every student in the 
population has an equal chance of being chosen : Mrs. 
Pedogog does tl>is by writing the name bf every student 

in the high school on a piece of paper, putting all of the 
names in a drum, mixing the names, and having the principal 
of the school (with blindfold on) draw 30 namea, one-by-one, 
from the drum. 

Mrs. Pedogog's sample of 30 students is a random sample for the simple 
reason that every element (student) in the population the sample is 
intended to represent had an equal chance of being selected. 

Did Joe .Smith, who is repeating the 10th grade for the fifth time, 
have the dame chance of being selected as Boyd Buttrom, the math 
genius? Yes, both names went Into the drum on the same size piece 
of paper* No reason for believing that any one student had a 
greater chance of being selected than anyone else. 

This example illustrates the basic ^iterion of random sampling: 
EVERY ELEilteNT IN THE POPULATION FROM WHICH THE SAMPLE IS COMPOSED 
HAS AN EQtiAL CHANCE OF BEING SELECTED. 



194 



7 



Another Example 

If you wished to determine the mean height of a population 
consisting of 3,000 college freshmen, you would have two choices- 
CD test all 3,000, (2) test a random sample and estimate the 
population mean. 

Being of sound mind, you would elect the latter alternative, 
keeping in mind that your sample (let's say 100 people) must be 
random. One possible way to select a random sample would be to 
sit down and write out all possible combinations of 100 names, 
and then, closing your eyes, select one of the lists. But, as 
you hopefully realize, the number of lists containing all possible 
coiribinations of 100 names would be so large that it would take a 
lifetime to write them all out. A more sensible alternative is to: 
Write down each name on an individual sheet of paper; place the 
papers into a container; draw out one name at a time, making sure 
^that on the second draw any of the remaining 2,999 names has an 
equal chance of being drawn, that on the third draw any ol the 
remaining 2,998 names has an equal chance, and so on until you 
reach lOO—the desired sample size. 

Question: Is the above sample random ? 

Your answer: Yes, because every college freshman in the population 
. (3,000 in all) had an equal chance of being selected: 
the 7' basketball center and the 5' class-clown. 

sj^ill Another I 

/ Suppose we wanted to know whether college students at Ukaducca 
Ui)[iversity are in favor of the administration allocating more money r 
f6r the football program. We don't wish to question all of Ukaducca's 
' 27,000 students, so we decide to use a sample. We select our sample 
by goin^o the Saturday football game and questioning every tenth 
student from Ukaducca that we find. We end up with a sample of 100 
students from which we can generalize to the population. 

Question: Is the above sample random ? ^ 

Your answer: Clearly, it is not.. All ^f Ukaducca's 27,000 students 
did not have an equal chance of being selected. Did 
a student who hates foq^all and spends all her Satur- 
days in the library, have the same opportunity as one 
who has never missed a game? Clearly not. Students 
who attend football games are not representative of 
the total student population. \ 

A Final One 

Suppose a teacher wishes to obtain a random sample from his 
8:00 a.m. class of 100 students, and he selects the first 20 students 
who arrive on Monday morning. 



195 



2bj 



ERIC 



Quefitlon: Is the sample random? 



Your answers Obviously ^ It Is not* What about the 15 or so 
students who cut class after the weekend? What 
about those *sjtudents who are usually late to 
class?^ T!hey did not have an equal chance: there- 
fore, the teacher has selected, a biased (not Random) 
sample. 



Final notes 



1. Random sat^ples are used In statistics because: 

a) From random samples you can validly generalize to 
the population vlth which you are concerned (but 
can never hope to t6st In Its entirety). 

b) Many of the statistical procedures developed for use 
,ln hypothesis testing (to be covered In Unit IX) 
require random sampling. If your sampling procedures 
are not random^ those statistical tests are Invalid. 

2. It may have bothered you in the discussion of random sampling 
procedures that the suggested method Involved writing names on 
pieces of paper and drawing them from a drum. Actually, statls- 
tlclans^-do not spend their time tearing up pieces of paper In 
the eternal quest for random samples. Two alternate sources 

are available. Just so you know about them, they are: (a) 
a random nuiid>er table ^ available In many statistics texts 
(not yours!), and (b) computers, which generate random numbers 
for the asking. 

^ 

3. The major point In this unit Is that whether you write names on ^ 
pieces of paper or use a random number table, you end up with a 
random sample If and only If all subjects (people, students, etc.) 
In the population you are measuring had an equal chance of being 
selected. 



7.1 POSTTEST (Answers In back) 

1. ^Describe what Is meant by random sample . 



2^ Why do statisticians use random sample^? 



196 j 

ERiC . .21 i 



3. 



Describe briefly how you could obtain a random sample of 50 
subjects from a populatloti of 300. 10 



4. A researcher Is Interested In determining the average yearly 
Income of male cltlzfens over 18 years of afee who reside In 
Catatonia, Nevada (pop, 424). To save time, he uses a random 
nun^ers table to select a sample of 50 from the Catatonia , \j 
phonebook* 

Will the resultant sample be random ? Why or why not? 



7,2 Instructional Unit ; The Relationship Between Parameters and 

Statistics 

In Introducing this unit, some steps In the route of statistical 
Inference wete briefly mentioned. They were: 1) drawlng^a random 
sample » and 2) computing statistics of the sample. 

You know what to do In step 1^1: drawing a random simple. You 
know what to do In step #2: computing statistics of the sample 
(computing means » modes, medians, variances, standard deviations, 
etc.). Then, we want to move to a third step— to make Inferences 
from the sample about the population. Remember, as sampling has been 
described In this unit, we have no Interest In the sample per se 
(I.e., tjie particular lOO cgllege freshmen that we "happened" to se- 
lect out of the population of 3000). We are Interested In the whole 
3000(1) and are merely using the sample to get the best picture that 
we can. But how do you estimate populations and test hypotheses 
about those estimates? It*s a fairly "giant" step which will take 
at least another unit (Unit VIII tb be exact) to even fully. Introduce. 
But we must make a start somewlipre. so we will begin by again mention- 
ing that the major function of samples Is to provide a basis for. 
making Inferences about populations. In Instances where It Is 
reasonable to describe all members of a population (e.g. r Mrs. Jones 
wants to find the average Uge of her 10 foutth-grade students), the 
use of samples would make little sense. In Instances where It Is 
unreasonable to describe dn entire population (the average age of all 
fourth graders!), use of a sample Is absolutely necessary. 

Obviously, whether scores are from a sample or a population. It 
Is possible to compute the various types of statistics we have dis- 
cussed In earlier units. A fl^an or a median, for example, can be 



197 



ERIC 



21 



obtained for either a sample or a population, using exactly the same 
computation procedures. But, perhaps you. are getting the picture 
that it is important to differentiate between the two types of data 
sources. The indices to which we have referred as "statistics" are 
actually labeled as such only when pomputed for samples. When the 
same indices (means, modes» correlation, etc*) are computed for 
populations , they are called parameters . Remembering this scheme 
is easy: (s)amples yield (s)tatistics 

(p)opulations yield (p)arameter8 

Note the use of Greek letters when referring to populations and 
Roman letters when referring to samples. This way the data source 
is never in doubt. Here are some examples: 

Population Parameter Sample Statistic 
Mean \ a x 

Standard Deviation 

Variance 

Correlation 

So, to say it again, we must rely on statistics as a means of esti*-* 
mating population parameters. But will the computed statistic neces- 
sarily give us the exact parameter of interest? The answer is "hardly 
ever," but the validity of the estimate (i.e., its closeness to the 
"true" parameter value) will depend on a number of factors. Here are 
the most obvious: , 

1. Randomness of the sample : If your sample isn't random, the 
accuracy of estimation should be reduced. For example, if the true 
average height of the entire population of male seniors in North-* 
South High School is 68 in«, it's doubtful that you would come even 
close if the sample you employed for estimation consisted of all male 
seniors on the basketball team. The probable mean value for that 
sample would be something like 7A in. ,\ with the result being an incor- 
rect conclusion that male students grow specially big at North'-South 
High. A random sample would be more likny to give a mean value 

much closer to the a^ctual parameter, 68 in. 

2. Sample size ; Should sample size relate to the accuracy of 
estimation? The answer is "positively." The larger the sample, the 
greater your chances of obtaining a vQlid estimate . To use an extreme 
example, suppose you were Interested in estimating grade point average 
(GPA) of students at North-South High, and decided^ to use a random 
sample of two students. By the "luck of the draw," the two names 
selected from the drum happen to be Joe Smitherene and Harry Duence, 
whose best subjects are homeroom and lockeroom, respectively. Vfhen 
you check the records to obtain CPAs, the computed value is 2 .50 on 

a A. 00 scale. But that is for both added together! Dividing by two 



P 



6 

/I 



198 



213 



f 



gives an avferage for that sample of 1.25. Unfortunately, this time 
randomness did not ptovide a close estimatjg of the population parameter. 
Someone else might select a different random sample of two, and by 
chance, select Herb Einstein and Geraldine Cranium, who spent the night 
of the Prom comparing equations for determining the stress load on the 
dance floor. For this sample, the estimate is 3.99. Someone else 
might use a sample of two and actually get the true population para- 
meter of 2.85. The point is that with such small samples, there da 
much room for error (chance variation): even one extreme case can 
radically affect the computed value. On the o the j: hand, if a larger 
sample, say 50 students, had been used, high extremes would probably 
balance out low extremes, thus increasing the chA<y:es of coming very 
close to the true parameter. 

3. Sample Statistic ; As will be discussed in more detail below, 
certain statistics gi-Te better estimates of their corresponding para- 
4ters than do others. For example, using a sample mean to estimate 
a population mean will normally provide more accuracy than using a 
sample median to estimate a population median. Further, both of these 
statistics give more accurate estimations than would a sample range for 
estimating the range for the population. The ways in which different 
statistics are computed allow for some to yary more (from sample to 
sample) than others. And some, like the sample range and correlation 
coefficient, tend to systematically differ from their corresponding 
population values . 

4. Population Form ; In using statistics to make inferences 
about parameters, another important factor is the form of the popula- 
tion. It will be seen later on in this unit and also in the next, 
that some of the procedures used for statistical inference-making 
depends on assumptions regarding normality , i.e., scores being dis- 
tributed in normal curve form. Analytical procedures used when a 
distribution approaches normality might not be appropriate when it is 
rectangular, seriously skewed, bimodal, etc. The general rule in the 
statistical world seems to be that the closer the population is to 
normality, the easier life becomes. 

5. Luck: Statistics are merely estimators of population para- 
meters. Even if you use the most conscientious random sampling 
techniques (with a gold plated wastepaper basket fmxa which to draw 
selections), a very large sample, a "reliable" statistic (e.g., the 
mean as opposed to the range) , and a perfect normally distributed 
population, there is no guarantee that the statistic you get will . 
provide a cloae estimate of the population parameter. The above 
factors do guarantee precision in the long run, but In any single 
sample there will always be a certain margin for error. The hypo- 
thesis testing procedures to be dealt with later on take these error 
margins into account in telling us how "confident" we can be about 
our estimates. Although you are not expected, at this point, to 
understand how the following figures would be derived, we might find, 
for example,, that the' sample mean we obtain of 106 allows us to be 



199 



21 t 



95% confident that the true population parameter lies between 
102.04 and 109.96, and 99Z confident that It li^s between 100.84 and 
111.16.. All we are attempting to say at this. point Is that statis- 
tics Is a game of probability; It ts only possible to forecast what 
will happen over time . What occurs on any draw Is unpredictable — 
otherwise Las Vegas would be a barren space In the desert and statis- 
tics a nonexistent field (shame on you If you cheered) . 

All of the above discussion naturally leads to our next topic. 

What Is a Sampling Dlstrll>utlon? 

We have made It clear that a given statistic computed from a single 
sample may not closely represent the true parameter of the population. 
But what would happen In the long run? That Is, If Instead of stopping 
with one sample, we computed the same statistic for many? Let's take 
the case of means as an Illustration. Here Is the situation: We have 
a population of 15,000 physicians In a particular region of the country 
and reason tp believe that their salaries, whatever they may^ be, are 
normally dlst^rlbuted. Since we don't have the time to obtain a salary 
figure for all 15,000 Individuals, we decide to use a random sample of 
100 to help andwer the pressing question, "How much do physicians' 
representing that population make?" Unbeknownst to us, somewhere In 
the Internal Revenue Service computer are data that would Indicate the 
actual and true mean salary for that group to be $80,000 (enough to 
make one think about med school). But not having this Information, 
we must go ahead and compute the mean for our sample. Will It be 
exactly $80,000? Very unlikely, as samples fluctuate. In this case,' 
we happened to get more than the usual share of "Fat Cat" surgeons, 
giving us a sample mean of $90,000. We have overestimated, unlnten-* 
tlonally of course. Suppose we had an Infinite amount of time on our 
hands and threw the' first 100 back Into the pool and then drew another 
random sample, and then another, and then another etc., etc. What 
mean values would we get? All different ones, of course; some would 
be higher than the parameter (80,000), some would be lower, and still 
others would hit It just about on the nose. The distribution of these 
mean values would comprise what Is referredto as a Sampling Distribu- 
tion . Here's a definition: Sampling Dlstrlbtttlon : A Sampling Dis- 
tribution of a particular statistic (e.g., the mean) refers to the 
probability distribution ofv that statistic (e.g., means) for all pos- 
sible samples of a given size from some population. 

Quite a mouthful, but the key to understanding here Is tXh^ to try 
to memorize what is said, but rather to sit back and see If you can* 
grasp Its meaning. We have, thus far, established the following: 

1. Random samples are used to derive statistics 
(means, correlations, etc.). 

2. The statistics provide a basis for estimating 
poi^ulatlon parameters (means, correlations, etc.) 

3. Due to chance factors, e.g., who Is selected In 
the sample, samples fluctuate. 



200 

2lo 



4, As a result of sampling fluctuation , the 
statistic cotnputed from one sample will 
probably differ to some e^ctent from the 

' same statistic, computed In the same way, 
from another sample. 

5. The distribution of these differing statis- 
tical values represents what Is called a 
Sampling Distribution. It tells you what' 
pdsslble values of the statistics can be 
obtained, with what frequencies, etc. 

'3 

The notion of a Sampling Distribution suggests an organization 
of scores that take on different values. This was the very idea with 
which we were co^c^ed in Unit I, when we organized scores in terms 
of frequency' distrabutions and frequency polygons. It might be 
worthwhile to relate this prior learning to the present topic by 
distinguishing bltween "ordinary, run-of-the-mill" frequency distri- 
lutions and this "imposing" new concept of Sampling Distributions. 



Here we go; 



Purpose 



Source 



Horizontal 
Axis 

Vertical 
Axis 

Central 
Tendency 



Variability 



Frequency Distribution Sampling Distribution 
(Unit 1) mitvi) 



Description 



To portray the numeri- 
cal values of a set of 
scores (X) 



A single sample or 
population 



Score values (X's) 



f'requency of scores 
<f's) 

The mean (X or'y) in- 
dicates the average 
score, interpreted as 
a "typical" value 



The variance (o or 
6^) or standard de- 
viation indicates the 
"spread" of scores 

Empirical (obtained) 
set of data 



To portray th§ expected 
numerical values ot a 
statistic (e.g., X's, 
/l's, Md's, etc.) 

All possible unique 
samples of a certain 
size from a population 

Values of a .^statistic 
(X's or others) 

Proportion of values 
(£) 

Expected Value represents 
the average value of the 
statistic; i.e., the 
"typical" estimate given 
by the random samples . 

The Standard Error indi- 
cates the spre&d of the 
values of the statistic 



Theoretical probability 
model 



ERIC 



201 



Thus, a frequency distribution tells you what happened in terms of ^ 
individuals' scores on a particular ineasure: How many obtained 
which scores? What was the most common score? What was the average? 
etc. . ^ 
^ • ^ 

A Sampling Distribution tells you what values^are likely if a 
certain statistic is computed from random samples selected from 
^ome population: What value of the statistic is most probable? 
What is the spread of the different values? etc. 

As you're reading along, perhaps you're ^^tting worried about the 
idea of Sampling Distributions* representing possible statistical 
values for all unique samples . "Will I have to construct one of 'these 
things?" you wonder. "That could take some tin^, and this course (as 
much as I'm enjoying it!!) isn't worth that amount of punishment for 
any grade!" you mutter (angrily). All of .a sudden, dust appears on 
the horizon and hoofbeats are heard... STATISTICAL THEORY to. the 
rescue!! Sampling distributions do not have M:o be constructed piece 
by piece— they are mathematically derived. More on that in the next 
section. • • . 

> 

Final Note ; Sampling Distributions must always be based on 
^samples of a given size (whether that size be 3,5,23,56,89,700, 
etc.), and the samples must always be selected at random from the 
same population. 



7.2 POSTTEST (answers in back) 



In yoiit own words, explain, the difference between stati^8tic8 and 
parameters. From what source is each derived? Which is used as 
the estimate for the other? How are they symbolized? 



2. What is meant by sampling fluctuation? How does this concept relate 
^ to the idea that saA^les allow Jfor probabilistic rather than certain 
estimations? 



3. In general terms, what does a Sampling Distribution represent? 
What are the major factors that determine its properties? 



202 



V 



7.3 Inatcuctlonal Unit ;^ The Notion of Bias In Stflitlstlcal Inference 

"XSainples flyctuate. If two or mare samples. Identical In size 
and selected from the same population, are used to derlvtf the same 
statistic, the valueeTobtalned In each case will vary. The Sampling 
distribution for that statistic represents x.\\& probability of obtain- 
ing each of the different values. But as it fums out, some statistics 
give "better" estimates of their corresponding parameters than do 
others. Actually, there are two groups: the Unbiased and the biased. 

Unbiased : Unbiased statistics, when computed from random samples, 
give values that differ from the population parameter only on 



the basis of chance (who was Included In the sample, testing 
I conditions, etc.). * If you take .samplKaf ter sample and compute . 

* 'that statistic for each, the individual values will tend to 

differ fron one anotjier (remember sampling fluctuation ?) but ^ 
^ will cluster around the population (the parameter) value. 

Examples of unbiased statistics are the mean, the median, iand 
the variance (computed by formula). If a statistic is ^ 
unbiased, it is valuable to 'Use as a basis for making an^in- 
» • ference about the population. 

Biased : Biased statistics, when computed from random samples, give 
values that systematically differ from the population parameter. 
If you take sample after sample and compute that statistic ior 
eagh, the individual values will not, only differ from each 
other, due to sampling fluctuation, but will qj^ister around 
. a value that is dti/ferenC f rom the population vklue. Examples 
of biased statistics are tHe range and the variance (if com-^ 
puted for a sample using the formula)* If a statistic is 
biased, using it to make an inference about the population can ^ 
be quite misleading. ^ 

* * * * * * * * *\t * * * 

The Great Living Room Squirtout 

or ^ 
How to Cross-JExamlne a Wetness • . ^ 

Pretend there is a latergun (we^re intentionally making this , ex- 
ample nonviolent) that is positioned 10 feet away from a target. The 
watergun is perfectly aimed at that "target, and to maintain that posi- 
tioning, is locked in a vl^se^^so^^tl^ it cannot be moved. 

^ You sitSot the gun. Will^the water hit the bullseye? Given that ^ 
it is perfectly aimed at the bullseye, that is the most j^robable desti- 
nation/ In fact, "though, many chance factors could prevent- the shot 
from being accurate; a north wind this time,, a south wind the rfext, 
a little too heavy on the trigger this time, a little too light the 
itext, "heavier'" water this time, maybe lighter water the next. If con-; 
I'.ditions are perfect we're, going to hit the target! If they're not, 

; , 



ERIC 



203 O f *. 




Notice that the bullseye was frequently ^s^ed, ^nd some of the misses 
were rather severe at that. But, if you had to guess where the 2l8t 
shot would landj the bullseye would still be the best bet. Chance fac- 
,tors (like wind) resulted in some* error, but the sl)ots nonetheless cluster 
around the bullseye, sinc^ that's where the gun is pointed. 

See the relationship between these shots and computing unbiased statis- 
tics from, random samples? 



Shots #21-40 - ^ 

* - * 

The next day, your clbumsy cousin Ferdie stumbles on the vise and 
moves it! Now the water* gun's aim is positioned 1 inch to the rifeht of 
the target! You shoot the gun. Will the water hit the bullseye? It can, 
but that's no longer the most probable spot. If conditions are absolutely 
perfe'cf, the water's going to land--you gat it! — exactly 1 inch to the 
right bf the bullseye ..^c^t^f a strong east wind blows, we might get the 
bullseye that way. Here'^s^hat happened in shots 21-40. 



V 



204 



Notice that the bullseye was almost always missed, and most of the shots 
are to the right of it. No surprise! The gun was (mistakenly) aimed to 
the right — and the shots will cluster around the aiming point. As far 
as the bullseye is concej^ried, our aim is biased. 

ic if if if * * * it * * * * * * * * * 

f * . 

Unbiased statistics, like the mean, are analogous to the first part 
of the above illustration. If your sample is perfectly representative 
of the population — and nothing goes wrong in the testing — the sample 
mean should be exactly equal to the population mean (bullseye 1 1) * But 
how many times are you going to get a perfectly representative sample? 
Random samples help to 'produce representativeness but by no means guaran- 
tee it. So, pany times, you'll miss the target, but your shots (sample 
statistics) w^^T^u^ter around the bullseye (population parameter) . 

Biased stat^lsjf^tei like the range, are analogous to the second 
part of the illustt^Sl!^ . Due to the way they're computed, the aim is 
all wrong from the very 4b#j|inning. The expected value is not equal to 
the population value. U&ihg*the sample range to estimate, the population 
ran^ is like trying to hit the bullseye (see illxistration) with your 
watergun pointed in the wrong direction. All you can do is (s)pray for 
luc^. But the act itself is ^advisable . Here's an example: 

Example • * 

^ Suppose that for the entire population of employees at General 
Mittens Co. the aye rage number of cups of coffee consumed each day (a^) 
eqiials 2.3 & tlange = 20. How do we know this? In real-life we wouldn't, 
but being allowed to make up hypothetical sit4iiations for instructional 
purposes gives one a great deal of power. Remember that the population 
mean and range would be derived based on all persons in the group. 
Range 20 would be derived by subtracting the Ipwest value *f or the 
•population (most likely, many people drink zero cups) from the highest 
value (in this case, Roderick Di Beaneo, a resident of Seattle, reported 
drinking 20. cups a day — the^iighest in the entire population), suppose 
we take a sample of 20 workers: will the sample mean, X, be identical 
to the real population mean, a = 2.3? No, unless we are extremely 



205 2^0 



lucky. But^ since we satttpled randomly, we should be able to regard the 
statistic, X, as a fair estimate. It could be too large or it could 
be too smalL, but given* the w^y in which the mean is computed, no sys* > 
tematic Dverestimation or underestimation should have taken place.' Any 
variation from the population mean should be due entirely to chance. 
If additional random samples were employed, some should yield X*s that 
are higher or lower* Over all possible samples, however, the lows should 
balance out the highs, with the central tendency of the Sampling Dis- 
tribution represented by the population value of 2.3. The larger the 
deviation ftom this central value, in either the positive or negative 
direction, the lower the probability of our obtaining a sample mean that 
size. What we are saying is that X provides an unbiased estimate of a^. 
You may "miss" on any given try, but the long run projection is otie of 
favoring X's that approximate a^. 

"wiTt^about the range in our sample? The range is a' biased statistic, 
as its deviation from the true population value takes place systematically 
rather than strictly by chance. Specifically, the sample range systema- 
tically underestimates the population range; after repeated sampling, 
you will get sample statistics that cluster around a value that is less 
than the true range of the population. Can you figure out why? Unlike 
the mean, the range is based on two scores only — th^ two most extreme. 
You may recall that in the whole population, the highest score was Rod 
Di Beaneo's 20 cups per day. What is the only way that the sample range 
can equal the population range? Rod Di Beaneo must be one of those 
selected!! If he is not, the sample range has to be smaller. If we 
don't get Rod^Dl Beaneo, we still have a decent chance of selecting 
some reasonably high score (Tom Perky, for example, who drinks 18 cups 
a day). But, can you ap^eciate what could happen if the population 
range were to be estimated from a very small sample? The low sfcore might 
be something like 2 cups a day and the high score only A cups. The sample 
range would be only ^, a very substantial deviation from the parameter 
value of 20. Note, however, that the mean for this very small sani)le 
could still provide a close approximation of U;^ = 2.3. 

Let's turn to some other statistics and ask the same question: 
Which provide unbiased estimates of population parameters and which 
do not? 

1. Mean (X): Means will differ from sample to sample. But the 
preponderance of sample means should cluster around the true 
population v^lue, u^, whatever it may be. The further the 
distance of X from the £arameter, the fewer the number of 
samples yielding those X's. The mean, as we have discussed, 
is an unbiased statistic. 

2. Range : The ranges too will differ from sample to sample, but 
due to the way in which ranges are computed, they will clus- 
ter around a value that is sroaller than the actual population 
range. The largest sample range ca[n only be as great as the 
parameter; all other sample ranges wfll be smaller, due to the 



206 



22i 



failure of the samples to include thfi highest and lowest 
(tnost extreme) scores in the population. The range is a 
' biased statistic. 

3. Variance t You may recall that the variance can be computed 
in two different ways: with n in the denominator (symbo- 
lized a2), or with n-I in the denominator (symbolized -42). 
It was discussed that the former procedure is used for 
making inferences about populations from samples. Now you 
8|bould"^et a better idea of why such a distinction is 
tiecesaary^*-- 

If you use t1^e formula IX^/hl (see p, 82) to compute the 
sample variahce,\ yoy will get a biased estimate of the 
population variance^, a2. Actually, for reasons we won't go 
into at this point, we will tend to get an underestimate of 

If we use the formula Ex^/n-I (see pi 86) to compute the 
sample variance, we get an unbiased estimate of a^. This 
formula, if you recall, gives a statistic called ^2 (see p. 
96). 4^ i^ an unbiased estimate of o^. 

4. Median (Md) : The medl^a, like the mean, is an unbiased 
statistic. If computed from different samples, the values 
obtained would tend to cluster around the actual population 
Md. What is interesting, however, is that the sample Md's 
would show^reater spread (greater variability X^han would 
the sample X's. Thus, while both statistics tend, over 
repeated sampling, to approach the population parameter, the 
chances of any single sample providing an exact (or close) 
estimation is greater for X than for Md. Remember when we 
discussed that the mean is usually the preferred measure of 
central tendency? One of the reasons is that it is a more 
precise estimator. 

5. Correlation (/l) : Two variables are needed for correlational 
analysis, so let's assume that chemistry achievement scores 
for all high school students are correlated with their junior- 
year grade point averages (GPA) . The question is, will the 
computed sample ' /t's represent unbiased estimators of the correlation 
coefficient for the population (p)? Unfortunately for ^those 
seeking the most straightforward reply, the answer is "maybe, 
maybe not." For /L, the rule is as follow^ if the population 

p = 0, /L does represent an unbiased statistic. That is, the 
sample /l's will tend to distribute around a central value of 
0.00. If the population correlation differs from 0.00 (either 
+ or -), becomes a biased statistic, for which the tendency 
is to underestimate the absolute value of the parameter. 
In conclusion, the 'Sampling Distribution of ft may or niay not 
reflect bias in estimating the population parameter, depending 
upon whether the two variables are correlated (bias) or not 
correlated Ofeo bias) in the population. 



207 



22, 



ERIC 




Optional explanation for those interested: Why would a sample /l 
tend to unflerestimate the population correlation? Siippose the cor- 
relation /between two variables is +.85 in th,e population. If you 
try to estimate this value by use' of a sample, the concept of sampling 
fluctuation suggests that in a single draw, anything is possible. The 
greatest probability, of course, is that you will obtain a positive 
correlation that is at least qlose to the actual population coefficient 
of ,85 . But given that correlations can range from -1.00 to +1.00, 
there is a much greater area below +.85 than there is above +.85. 
Look below: . * 

\ ' 1 ^— 

-1.00 ' .00 .85 +1.00 

Thus, followinig repeated sampling, you will end up with many valued 
clustered around the +.85, but some will (by chance) fall in ^he 
rather extreme areas below .85 . These extreme values will tend to 
pull the Expected Value (i.e.> the average sample K) down, and thus 
yield a Samplia/ Distribution that systematically underestimates the 
parameter. 

* * * * * * * * * * * * * * ** * 

Final note before practice post test : If sample values tend to 
cluster around the £opulation parameter, we identified the statistic 
as unbiased (e.g., X); if they tended to yield an average equ&l to 
some other value, we identifieid the statistic as biased (e.g., range). 
Sampling Distributions are theoretical models which are based on what 
is mathematically determined to occur if a statistic is computed from 
all unique samples that can be generated from a population. 



7.3 POSTTEST (answers In back) 

Assuming that the population in question is normally distributed, 
which of the following statistics would comprise unbiased esti- 
mators of thetr corresponding parameters. For any that you 
characteri-ze as biased, specify the direction that the bias will 
take (overestimate or underestimate) . 

a. Md 2 

b. variance usitig ZX^/n-l 

c. variance using EX /N ^ 

d. X 

e. A. (p » 0) 

f . K i 0) 

What is the relation betLeen sample size (n) and the accuracy of 
estimation? 



1. 



9 



2. 



ERIC 



208 



7.4 Instructional Unit; Sampling Distributions of Means 



Given that the mean is probably the most commortly used ^atlstic 
for Inference-making, it seems important to describe the properties 
of Its Sampling Distribution in a little more detail. First, what 
might a Sampling Distribution look like? At first glance, it can^ 
easily be mistaken for a Frequency Polygon (Unit D • Actually it s 
quite similar, but while the Frequency ^ol^gon tells you the fre- 
quency with which different score values (X's) were obtained from all 
people, the Saan)ling Distribution tells you the frequpacy (or proba- 
bility) with which different Mans (X*b) were o>l«tii^from all unique 
samples . 





X 



SAMPLING DISTRIBUTION 




Regular "OLD" 
FREQUENCY DISTRIBUTION 

Question: What three properties are important in describing 
distributions of scores? 

Answer: (from Units I, II, and III) Form, central tendency, and 
variability . , 

Just as we dealt with these properties in discussing frequency 
distributions. Sampling Distributions have form, central tendency, 
and variability. 



ERIC 



209 



22> 



How are the values, distributed when computed for all unique samples, 
of identical size, selected from the same population? Let's deal 
with form first, assuming random sampling in all examples. 



Form 



If the population from which sampling takes place is normally 
distributed, the Sampling Distribution of means will also be normally 
distributed. Assume that height is normally distributed for a certain 
population (e.g., all females in Omaha). If means (X's) are computed 
for all unique samples of a certain size selected from that population, 
the values of those means will .be normally distributed! 

HOWEVER, if the population scores are not normally distributed, 
the form of the Sampling Distribution of means will approach normality 
as sample size increases . What this suggests is that, regardless of 
the form of the population (rectangular, positively skewed, etc.), we 
will end dp with a Sampling Distribution resembling the normal curve 
if sample size (n) is large enough. For instance, suppose that the 
distribution of women's heights in Omaha is positively skewed. An _ 
infinite numiber of very small samples (e.g., all n's « 2) may yield X's 
that do not distribute normally. _An infinite number of large samples 
(e.g., all n's « 30) would yield X's that tend to distribute normally. 
(The advantage of a "normal" Sampling Distribution is that it can be 
analyzed via the "normal curve probability estimates" with which we 
dealt in earlier sections — more on this later.) ^ 

How big must n be to ensure normality? You do not have to memorize 
these, but please read them; they provide valuable guidelines for making 
decisions about sample size. * 

A. If your population data are normally distributed^ 
a sample size of only 4^, (W " 4) is sufficient to 
insure normality in the Sampling Distribution. 

B. If your distribution is not normally distributed 
but is symmetrical, samples of 10 ^ 15 cas^s may 
be needed. 



D. If population is moderately skewed, 20 cases may 
be called for. 

Example Problem (just for fun): 

A Sampling Distribution of means is based on samples of 5 cases 
selected from a population that is moderately skewed in form. Would 
it be correct to assume that the Sampling Distribution is normally 
distributed? 



C. 



If your distribution is slightly skewed, you may 
still need at least 10 cas^s. 



210 




Ansve r ; l^o. For a moderately skewed population > saisples of 
size 20 (see "D") are generally needed to Insure normality 
'in the Sampling Distribution. 

Suppose the population data were normally distributed. What then? 

Answer : Samples of n • 5 would be adequate to insure normality 
(see "A"). 

Central Tendency 

In discussing the concept of bias, we said that if the typical 
sample statistic was expected to be equivalent to the population para- 
meter, the estimator was identified as being unbiased (e.g., -6^, Md, 
and, of course, X.) The mean value of a Sampling Distribution is called 
its Expected Value, and syiAolized as, E( ). Inserted in the paren- 
theses is the symbol for the particular statistic being characterized; 
thus, we can have E(M(i), E(range), etc. 

In the case of means, the properties of E(X) can be very easily 
identified by the following rulei E(X) « Ux. What this means is that 
if X's are £omputed for an infinite, number of random samples, ^the mean 
of all the X's (i.e., the average X) will be equal to the population 
mean (u^) . 

Example ; 

Suppose that for all bluejays who nest in a particular park, the mean 
length of individuals is 9 inches. All unique random sample^ of 25 
bluejays are identified and an X is computed for each. The X's will 
naturally differ from one another, but if all_are averaged together, 
what value will be obtained? The answer: E(X) « 9, the population 
mean I Suppose that X's are computed for all unique samples of 10 
individuals. What will these X's average? The answer is the same: 
E(X) » 9. Does the f£rm of the distribution matter? It does not; in 
the case of means, ^(X) = regardless of form or of sample size. 

Here is the rule again: When means are computed for an infinite 
nuiii>er o£_rando» sraples (of a certain size), the mean of the means 
(i.e., E(X)1 win bm equal to the mean of the entire population. r 
Understand what this means? Try your skill on this one: 

_ The average weight of adults in a particular town is 135 lbs. If 
X's are completed for all unique samples of 25 people from that town, 
what will E(X) « ? 

Answer: E(X) » 135; the mean of all the means wili be 135. 

Suppose the population was positively skewed, would the E(X) change? 
No! Suppose the sample size were reduced to n « 4; would that have 
an effect? Well, it might reduce our chances of obtaining a normally 
distributed Sampling Distribution - depending on the form of the 



211 



population (see previous section) - but central tendency would conform 
to the same rule: E(X) a^* 

Here"^e the symbols again for a final view before we turn to the 
final Sampling Distribution characteristic, variability. 

X : the mean of an Individual sample selected from a 
population 

u : the mean of the population 

E(X) : the mean of a Sampling Distribution of X's 
for that population 



Variability 

Thinking way back, you may recall that Unit II (Central Tendency) 
was followed as closely as was possible by a unit on the subject of 
variability (Unit III). Does Unit III apply to what we're doing here? 
There Is no question that It doe§ -^anytime we have a distribution of 
scores, whether it consists of raw scores or sample means. It helps to 
give a charactyerlzatlon of form , central tendency , and variability . 
Having already covered the first two concepts In reference to Sampling 
Distributions of Means - we are now ready for the third. What about 
varl^lllty? 

The vcjUanciL and the ^tandaAd deviation are the most respected and 
revered Indices of variability. Theoretically, we can compute them for 
Sampling Distributions using the same formulas employed for frequency 
distributions In Unit III. 

The standard deviation of « Sampling Distribution, like the 
Expected Value, Is so Important that It Is given a special name— the 
Standard Error . The Standard Ertor Is simply the stjand^rd deviation of 
all values In the Sampling Distribution (keep In mind that these values 
are all sample means) . To avoid confusion with other typgs of standard 
deviations, the Standard Error Is symbollze^d as ay; this symbol suggests 
that we are dealing with the standard deviation (a) of sample mean (X). 

Let's stop for a moment and get our symbols straight.. 
Question: What do the followlng^symbols mean: o, -6, and a^? 

Answer: o refers to the standard deviation of all raw scores 
In the population. 

6 refers to the atanclard deviation of all raw acores 
In an Individual/ sample from the population. 

a— refers to the standard deviation (called Standard Error) 
^ of all scores (sample mean, Xi) In the Sampling 
Distribution. 



212 

2^/ 



Uhat does the Standard Error tell us? It Indicates the variability 
in the Sampling Distribution. ^It Is particularly valuable In alldvlng 
us to compare the variability (or sampling fluctuation) In two or mors 
different Sampling Distributions. 

Properties of the Standard Error I 

Suppose we have a population of 100 test scores wltt^ Ux " 57. 
We rahdomly select an Infinite nunber of sanples of size )5 and compute 
their raaafRs. The x^s^l^ Is a Sampling Distribution of means with an 
Expected Value, E(X), of 57. Right? ! 

Use >your Imagination, and pretend that It would be praictical 
to compute the Standard Error by laboriously subtracting E(X) from 
each of the Infinite nunter of sample toeana, squaring the Resultant 
deviation scores, averaging them, and taking the square root of the 
result. In practice, this would be absurd, but let's say that we 
were committed to the taski and that the result was equal to 3.00. 
Now we have a Sampling Distribution that we can characterize quite well. 

E(X) - 57 



If you remember the material on 7, scores, you realize that given the 
mean and standard deviation of any normal distribution, we can compute 
the percentage of scores that fall between any two z scores, raw scores, 
and so on. Knowing the Expected Value and Standard Error of a Sampling 
Distribution can be very valuable. We'll save this for later, however. 

For how, simply consider the above results and think about the 
following situation. Suppose we went back to the same population of 
100 test scores, but this time selected at random an Infinite numbeir 
of samples of size 10. We end up with a Sampling Distribution of means 
based on 10, not 5, cases. 

Question: What would be the Expected Value of this nw • 
Sampling Distribution? 

Answer: E(X) • 57, of course! True, we changed sample £lze, 

but the basic principle Is still In effect: E(X) - Ujj. 

The following question, which Is similar to one posed in Unit 6.2, is 
really the "meat" of this section. Before looking at the answer ,j,u9e 
your powers of intuition and pure logic and give it a try. If you get 
the right answer (for the right reasons), the remainder of this unit 
will be smooth sailing. If you cannot get the answer, give the accon^^ 
panylng explanation your full attention. 

THE QUESTION ; In the first example, we ptetended that the Standard 
. Error of the Sampling Distribution for 5*case samples was 3. 
Would the Standard Error of the new Sampling Distribution, 
baaed on 10 cases, be egual to , greater than , or less than 3? 
In other words, would the variability between sample means be 
different when all samples are made up of 5 cases as opposed 
to 10 cases? 

f 

213 ' [226 



THE ANSWER ; The variability (Standard Error) of theclO-case Sampling 
Distribution would be less than that of the S-case distribution. 
Why? The lO-case sample means are likely to fluctuate (from ti^) 
less than the 5-case s^ple means simply because more scores are 
being taken into accouTt In each case, 

* NEW QUESTION ; What is the most accurate way of determining the mean 
of a population of 1,060 scores? 

ANSWER ; Take all of . the 1,000 scores aad average themll 

NEWER QUESTION ; What would be the second most accurate way? 

ANSWER: ' Take a sample of 999^ scores and average them!! 

NEWE ST; The third most reliable estimate? 

ANSWER : Take a sample of 998 scores and average them. 

The larger the samples, the closer each mean will probably be to the 
population mean and to each other!! 

For those serious students who are still plagued by doubta and uncer- 
tainties, more explanation can be found below, 

*** ******* 

Pretend that from our population of 100 scores with " 57, we 
consiilluct a Sampling Distribution based on samples of size 99, The 
Samplihg Distribution, by definition, would show the proportion of the 
sample means of different values that we obtained. Understanding that 
each sample mean is based on 99/100 scores from the population, would 
there be much variability? Would there be a large Standard Error? 

NOl Each sample mean would be extremely close to a^^. After all, 
each ia based on every score from the population save one. Thus, a« 
we draw sample after sample, and compute X for each, the resultant 
values would probably be ones like 57,05, 56,98, 57,01, 57,23, 56,89,* 
etc. See what's happening? With only one member of the population 
missing in our samples of n « 99, each sample mean would almost be 
identical to " 57, Variability or Oj of the Sampling Distribution, 
would therefore, be very small. The Sampling Distribution, when 
plotted, would have, of course, ian Expected Value « to 57, and look 
something lik<^ this; 




Note; Nearly the 
entire distribution 
Is between 56 and 
58~small variation! I 



ERLC 



214 

22j 



On the other hand, pretend that from our population of 100 scores 
with » 57, we construct a Sampling Distribution of means based on 
2-ca8e samples. Understanding that each sample mean Is based on only 
2/100 scores from the population, would there be much variability? 
Woul4 there be a large Standard Error? 

/Yes I Yes I Using only two cases, some of the sample means would 
be ^xtremely hlgh» and some of the sample means woisld be extremely low. 
Thus, as we draw sample after sample, and .compute X for each, we could 
get fairly discrepant values such as 42.06, 65.28, 58.01, 39.86, 49.37, 
etc. With only 2 people being considered In each sample, the chances 
for variation — from sample to sample-- would* be relatively large. There 
fore, variability of of the Sampling Distribution would also be 
large . i 

The Sampling Distribution, when plotted, would have aa Expected 
Value a to 57, and *look something like this: 




ANOTHER BASIC LAW TO REMEMBER: As sample size Increases, the Standard 
Error of the Sampling Distribution decreases . The Expected Value, 
however. Is unaffected by sample slze« 



Computing the Standard Error 

' We have thus far been able to define with pinpoint j[ccuracy one of 
a Sampling Distribution's most Important properties: E(X) * u^. 

Likewise, ve can determine trhe Standard Error of a Sampling Dis- 
tribution through a mathematical derivation. Through genius, patience, 
and sophisticated mathematical procedures, statisticians have com&^u{i 
with the following formula: 




' 230 



In a sentence this equation idban^, the Standard Error of a 
iSampling Distribution is equal to the population staridard deviation * 
(o) divided by the square root of the sample size (n) . 

Example Problem 

. ^ Given a population of 1,000 scores with » 97 and o ■» 20, 
^hat is the Expected Value and the Standard Error of a Sampling 
Distribution of means where samp^-e size is n » 16. 

The flrat question is easy; E(X)- u^^; Expected Value = 97. V 

The second question is also easy: 

cj - a ; oj. 20; oj - 20/4-5 ■ ^ 

*not^ n - sample size , not population size. ^ 
Example Problem 

From the same population, we construct a Sampling Distribution of 
means where sample n - 4. What is the Expected Value? Standard Erro|^? 



Answer: E(X) again - 97. 



lb 



Since sample size Is less this time, we expect variability to 
increase. Using the formula, this turns out to be the case. 




- 20 

2 

- 10 




The fonnula for computing the Standard Errot is entirely consistent 
m^th the basic premise that suggests that as sample siz e increases. 
Siriabilitv of tte Sanoling iTistribution will decrease . Can you see why 
that .is the case *oth l(jgically and mathematically (or either) ? 



Final Motes 



1. The Standard Error is the standard deviation of a Sampling Distri- 
bution, "in the case of taeans, which' we have been discussing, it is 
symbolized., Oy. This symbol very simply says that, we are representing 



216 \ J 



23i 



the standard devlatldh, o, of sainple means, X's. 

If you like analogies: ol- Is to a Sampling Distribution what o ^ 
Is to a population of scores. 

» 

2\ If you klfow o for a population, you can determine the amount of ^ 
variation between sample means for all unique samples based x)n th^ 
same number of cases. ^ ^ 

3. Descriptive statistics (Units I-V) help us to chafacterlze a given 
set of scored. Inferential statistics help us to pake probability 
Judgments about populations using data from samples • Sampling 
Dlstrlbutlona are probability models; they tell you what to expect j 
In the long run If you rely "on samplet of a given size to estimate 
^"certain population parameters (means, standard deviations, etc.). 
In restricting our focua to sac^le means, w6 have found that wltb 
the availability of certain "glvens," such probability models can 
be derived In terms of form, central tendency, and variability. 
As will be seen In the next section, these three characteristics 
are all we rfeed to know to begin the Inferential process. 

7.4 POSTTEST (answers In back) 

1. Describe the relationship between sample slze^ (n) and the form of a 
Sampling Distribution of means. 



2. How. does form of the population Influence the above relationship^ 



7^ 



3. For each of the/synd)ois In^he lefthand colufen, match the appropriate 
description from^those listed at the right./ ^ # 



a. 


X . 


1. 


- b. 


0 


2. 




<y— 


3. 




X 




d. 


H 


4. 


e. 


& 


5. 


f . 


E(X) 


6. 






7. 






8. 



the population mean 



c 



E(X) 6. the stai^ard deviation of a sample 

the Expected Value of. a Sampling Distribution 

^ ( • 

8. the variance of a saqile ^ 

\ 

217 



ERiC ^ 23 



4. What is the relationship between and E(X)? How is this relation- 



a^p influenced by n? 



5. What id the relationship between a and o^? How is this relation- 
ship influenced by n? 



A researcher is concerned with a population of test scores where 

= 70 and a = 20. Determine the Expected Value and" Standard 
Error of Sampling Distributions of means wh^re: 

a) -the sample W = 4 , . 

b) the sample n = 16 

c) the sample n » 64, * • 




7.5 Instructional Unfit : Computing Probabilities for Obtaining 
J Sample Means ^ 



You are about to see how Sampling Distributions can actually be put 
to use in answering certain types of questions. Tercet through this 
section, and thereby finish this Unit, you will need to be comfortable 
with the following: 

1) the general concept of Sampling Distributions; 

2) determining the Expected Value and standard Error of a 
Sampling Distribution; and 

3) the use of the normal probability model ("area under the normal 
ct^ve" table) presented in the last few units of Unit IV., , 

Suppose that evfery fifth grader in Merryville, New Mexico (5,000 in 
all) was administered the Auto Mechanics Aptitude Test . The mean score 
fo¥ the population was found to be 50 , and the standard deviation was 
found to he 12. The distribution was also found to be normally distri- 



buted. ^ 

Mr. ^easer, an auto mechanics teacher, asks himself the following^ 
question: "if I were to take a random sample of 16 fifth graders and 
give them the aptitude test, what would be the chances that the mean of 
my 8ailSf>le would be greater than the population mean (a^ ^ 30), but less 
than 53?" 



218 



Given the above information, can Greaser's question be answered? 



Yes it can! Sampling Distributions shoT> the probability of obtain- 
ing sample means of different values. That statement is definitional. 
If we know the properties of the Sampling Distribution that defines Mr. 
Greaser's students, the question he poses can be answered with ease. 

Can we characterize the appropriate Sampling Distribution? 

It can be done very easily The mean of ^he Sampling Distribution 
of means would be equal to the population mean, 50. 

E(X) « Ux ; E(X) = 50 

The Staiidard Error: of the Sampling Distribution can also be easily, 
derived: 



12 



12 
4 



= 3 



Since the population was normally distributed and the sampi© size (n=16) 
is fairly large, we know that the Sampling Distribution will resemble a 
normal curve . v 

/ 

Given the Expected Value, Standard Error, and form M-^fRfe Sampling 
Distribution^ we can now actually draw it . Using the "area under the 
normal curve" table as we did in Unit IV, we can aAswer questions re- 
garding the prohability of selecting samples (of size 16) with means of 
different values. ^ 



Sample Mean 
z score 




In order to determine probabilities, you will need to convert your 
raw scores (actually, sample means) into^ z scores. 

z = X - X in this case: z = X - E(X) 

X 



219 



for a 8an5)le mean of 50: 50 - 50 » z « 0,00 

3 

for a sample mean of 53: 53 - 50 « z « +1,00 

Use the "area under the normal curve" table to find the distance 
between your Z scores. 

\ 

From the table we find that for Z = +1,00, we have 34,13% of the 
area between that value and the mean (z - 0).« 

Our Answer: If Mr. Greaser selects a random sample of 16 students 
. from the population^ there is a 34.13% chance that the mean of 
the sample will fall between 50 (the population mean) and 53. 

All we are doing is using normal curve probabilities to determine 
the likelihood of obtaining sample means of certain values. The proce- 
dure is identical to that used in Unit IV, except of course, that the 
Z scores which comprise the referencing £pints for the "normal curve" 
table are now based on distances from E(X) divided by o^. Why these 
and not our old friends X arid o from Unit IV? Shame on you if you ^ 
asked: We are dealing with a Sampling Distribution of means, and these 
elements are what define the Sampling Distribution. The^^concept of^ 
Sampling Distributions;* is now being applied to answer questions lik^ 
"If a sample of a given size is selected from a cerjtrain population, 
what are the chances that the sample mean will devialBl^by *so many 
points' from the population niean?" "What are 'probable' sample m^^s 
and what are 'improbable' oifes?" etc. ' 

If, in attempting to solve some of the problems on your own, you 
find that your answers consistently differ from the correct ones shown, 
the source of your difficulty is probably one of the following: 

1. You never read Unit IV and are attempting this section by 

ambition and continual prayer. 

2. You studied Unit IV by rote, and have no understanding of what 

"normal curve area" problems involve* 

V (For either of these, the only possible remedy is to review 

Unit IV now. Consume two sections with a glass of water before 
going to bed.) 

3. You learned Unit IV well, but that was months-ago, as you have 

taken a long break in getting here. Remedy : a quick review 
of Unit IV should get you healthy again. ] 

4. You know the "area problems" cold, and think that the present 

stuff is very easy, but glancing at your work, you now notice 
that in trying to derive Z scores for the sample means, you 
used a in the denominator instead of a/y/fTi If such were 



ERIC 



220 



23o 



the case, you unjustifiably attempted to mix apples and oranges 
in your equation, resulting in absolutely no chance of obtain- 
ing a correct answer. Remedy ; You have been careless. If you 
have been drinking a beverage while doing your calculations, 
check its contents for any "strong" substances; replace drink 
with orange Juice and get a night's sleep. Tomorrow start the 
morning by saying to yourself 100 times, "Sampling Distributions 
have not a as their standard deviation index.'* 

Example Problem 

Xn the fictional town of Feckless, the citizens are famous for 
flaunting face freckles. Each year, the Feckless Face Freckle Festival 
is held, at which time a count is made of the face, freckles flowering on 
fellows from Feckless. Descriptive indices obtained for the population 
at the very last freckle figuring w^re = 50 and a = 10. Suppose that 
you are a visitor in Feckless who feels that the face freckles figures 
forwarded are fraudulent* You think that ^ meari af 50 freckles a fel- 
low is farfetched and that the real figure is probably more like 37 (a 
fairer freckle frequency in your mind). Having limited time in Feckless, 
you are forced to limit your investigation to a random sample of only 
four people; i.e., n A. Even though your challenge of the Feckless ^ 
face freckle figures will be futile (Ux does equal 50), if you proceed 
with a count for your randomly formed foursome, what ^re the chances that 
the obtained X will support your initial hypothesis by being less than 37? 
In other words, assuming normality, what is the probability of obtaining 
a mean lower than 37 in a random sample of n A? 

Step #1 : You may recall from Unit IV that we called this a two-step 
problem: (a) use of the "formula" to convert^ a score to an appropriate z 
value, and then (b) U8e of the "normal curve table" to represent the z 
as a probability Index. To use the "formula" as our first step, ^however, 
we must determine the parameters of the Sampling Distribution that define 
our problem (i.e., the Expected Value and the Standard Error are needed). 
At the next ste^i^^ should sketch the Sampling Distribution (the normal 
curve), labeling the givens and shading the area of concern. 



^ E(X) = ax Ox = _a 




» iO 
2 



5 



221 



23d 



37 50 Xs 

Step #2 ; Convert your Sample mean to a score and represent 
the latter on your diagram. 

For a sample mean of 37: z = 37 - 50 = -13 = -2.60 

10/V^ ^ 

Step #3 : Use "area under normal curve" table to locate the 
area in question. Remember that the probleih asked us to 
4etermlne the probability ot selecting a sample mean below 
37. Thus, we need to find the area below 2 = -2.60. The 
table shows an area of 49.53% for that particular 2. But, 
remembering that the table works from the middle outward, the 
49.53% must be taken to represent Area B on the graph - the 
area between z » -2.60 and z = 0. We are interested in Area 
A — the area below 2 = -2.60. This is obtained by remembering 
that half of a normal curve constitutes 50% of the total scores. 
Area A and B. therefore, equals 50%. To get Area A, we must use 
subtr^tlon: A B - B • A; 50 - 49.53 = .47% , 

Ans. .47% (less than 1%) represents the probability of 
selecting four people at random from the Feckless 
populace and discovering that they average less than 
37 in number of face freckles. A sample mean of 37, 
even with only n 4, would be an unusual result, 
given that the population parameter is ii;^ ° 50. 



222 



23, 



One More Example 



From a population of scores where Ux » 100 and a = 15 
ve select a random sample of sl^ze 9^, What are the 
chances that the sample mean, X, will fall between 
101 and 104? 

Step #1: Determine the parameters of the Sampling 

Distributioti. , Draw the distribution, pen- 
ciling in area of interest. 




Step #2; Convert your sample means into z scores. 



sample mean of 101: z = 101 - 100 . « 1/5 - +.20 

5 

sample mean of 104: z « 104 - 100 = 4/5 « +.80 

5 

Step #3: Use "area under normal curve" table to locate 
area of interest (Section B). 



223 

' 236 



r 

Table shows that 7 .93% of area falls between 
2 - 0 and 2 » .20. This is Section A. 

Table shows that 28.81% of area falls 
between z » 0 and z » ,80. Thls^ Is Section 
A B . . ^ 

To find Section B alone: 

28.81Z - 7. 93% » 20.88Z 

A B - A » B 

Our Answer; There Is a 20.88% chance that the 

sample mean will fall between 101 and 104. 

< 

Final notes: 1. Always use normal curve diagram* 

2. Keep in mind that these problems are 
very similar to the ones you attempted 
to solve in Unit IV. The only 
difference is that you are dealing 
with a normal distribution of sample 
means (the Sampling Distribution) / 
rather thdn a normal distribution of 
Individual scores. 

7.5 POSTTEST 



1. A sample oi^l6 cases is selected from a population where 
Ux " 25 and a » 4. Determine the probability that the 
sample miean will be: 

a. greater than 26 

b. between 24 and 26 '* 

c. between 23.5 and 24 

2. From the same population as above, a saiSpie of 4 cases 
is selected. Determine the probability that the sample 
mean will be: 

a. between 26 and 28 

b. less than 23 

c. between 24 and 26 

Recommendations for Unit VII: 



1. Unit Vtl does not place too much emphasis upon compu- 
tations. But, be sure that you understand the major 
concepts presented. 

2. Do not memorize definitions and concept descriptions. 
Try to define the concepts, like Sampling Distributions, 
in your own words. Your own definition is always more 
meaningful than a memorized one from the text. 

224. 




23J 



Unit VII Review Test 
(answers In back) 

The following description will be used to generate ques- 
tions that should cover all (or most) of the major concepts 
discussed in this unit. Try to answer the questions without 
looking back; If you must consult the text, that Is a good 
indication that the particular section examined needs some 
review. Here Is the description: 

Through careful investigation. It is determined that for a 
certain population of 1000 dogs, the number of bones burled 
by individuals in 1977 averages 8. 5. with a standard deviation 
of 2.3. Bone burying scores are normally distributed for the 
population. 

1. Assuming that you didn't know the abpve parameters 
and were Interested in estimating the population mean: 

a. why would a random sample be Important? 

b. how might you select your sample (there is no 
one correct answer)? 

c. what would be considerations involving sample 
size (e.g., advantages/disadvantages of large 
vs . small) ? 

2. Assume that sampling (in a hypothetical case) was 
repeated over and over such that all unique samples 

of size n were characterized in terms of the following 
statistics. Which of these statistics would have 
Expected Values equal to their corresponding popu- 
lation paramelpers? In other worids, which |ould jglve 
^ unbiased estiiites? 7" 

a. i- 

b. Md 

c. range 

d. fL (population p - +.84) 

e. /L (population p - 0.00) 

f. variance (n-1 in denominator) 

g. variance (N in denominator) 

3. What is meant by Sampling Distribution? 



225 2i.J 



4. Describe how the following factors Influence the natu 
of a Sampling Distribution: (a) the statistic, (b) 
sample size, and (c) the population. 



5. If a Sampling Distribution of means was derived from 

samples of n 16 in the bone burying example (see Initial 
description) what would be ktiown about Its characteristics 
In terms of: (a) form, (b) Expected Value, and (c) Stan- 
dard Error? 

V — — — ^ — - 



6. If the Sampling Distribution were based on samples of W = 
25, which of your answers to the preceding question (//5) 
would change? 



7. In reference to the bone burying example, suppose a random 
sample of 4 dogs was selected and characterized In terms 
of the average number of bones burled by Individuals In 
1977. What would be the probability of obtaining: 

^a. a sample mean greater than 9.0? 

b. a sample mean less than 7.0? 

Assume the sample was based on k " 9. What would be the 
chances 6f obtaining: 

^c. a sample mean between 8.3 and 8.7? 

J;; ^d. A sample mean between 8.5 and 9.5? 

_e. (SUPER BONUS) Above what score would 5% of 

the possible sample means fall? 



ERLC 



226 



2-li 



UNIT Vill 



HYPOTHESIS TESTING V | 

I 

A. General Objectives 

In education and the behavioral sciences, the need to design and 
conduct experiments In studying phenomena Is obvious. The present 
study of statistics has been tb a large extent an attempt to equip 
the student with an understdndipg of how statistics can be used to 
describe a set of data. In this unit we will go one step further In 
using statistics to test hypotheses (predictions or "educated guesses") 
In a logical, objective manner. After completing this unit, the stu- 
dent should be able to understand hypothesis testing whenever he/she 
confronts It In the literature and be able to employ It In conducting 
and analyzing simple experiments on his/her own. 

B . Speci fic Objectives 

8 A Describe the rationale for hypothesis testing 

8.2 Define null hypothesis and alternative hypothesis 

9.3 1. Differentiate between Type I and Type II errors 
2. Select level of significance ^ 

8.4 Test hypotheses when population parameters are known (z test) 

8.5 Test hypotheses when population parameters are unknown (t test) 

m 

8.1 Instructional Unit : Rationale and Basic Procedures 

The reasoning behind hypothesis testing follows directly from the 
work we have done with Sampling Distributions. Remember In the last 
section of Unit VII we used the normal curve model and Sampling Distri- 
butions to determine the probability of obtaining certain values of X. 
For example, if we know that the population of a certain school has f 
mean IQ of 100 and a standard deviation of 15, we can use this informa- 
tion to answer questions such as "If I select 10 students at random 
from that population, what would be the probability that the mean of 
the sample would be between 95 and 105?" If you do not understand how 
Sampling Distributions can be used to answer that, you are in need of 
a review — go back to the last unit in Unit VII now . ♦ : 

We will try to add a little more to the above reasoning and see how 
to test a hypothesis about the value of a parameter using sample statis* 
tics. First, keep in mind that hypothesis testing is a decision-making 
process. Researchers do not conduct experiments simply for the purpose 
of "seeing what happens." Good researchers always have a hypothesis 



227 



(prediction or educated guess) of what the experiment will eventually 
show. Examples of hypotheses are: "I hypothesize that studentdvleam 
more under programed Instruction than under conventional lnstructlon»" 
**I hypothesize that the students at^ my school are brl^ter than th<e 
average student In the city "I hypothesize that Drug A will be itiore 
effective than Drug B In reducing the Incidence of heart attacks,*'. • • 
and so on... Once a hypothesis la^formulated, the researcher designs his 
experiment for the specific purpose of testing that hypothesis. 

Hypothesis testing Is grounded on the notion of probability. In 
education and the behavioral sciences, our experiments never conclusively 
puppop^ one hypothesis over another. Given the results of one or more 
experiments, we can never say that one hypothesis Is definitely more 
valid than another. We caii say, however, that on the basis of our results, 
one hypothesis Is probably more valid than another. The more striking 
the results, the more certainty we attach to our conclusion; the more 
probable our conclv^toxTTwcQmes • 

Suppose one hypothesizes that'\];ie^jBti^nts In Genius Junior High 
School have higher IQs than the average student In the city. lo test 
this hypothesis, nothing fancy Is really needed. We administer an IQ 
test to a sample of Genius Jr. High students and compare the results to 
the IQ scores for the city as a whole (which we get from the clt^. school 
administration). Suppose results show that the Genius Jr. High X " 127 
and the city mean (Ux) " 100. Can we now conclude with all certainty 
that positively the Genius Jr. High students are brighter (In terms of 
IQ) than the average student In the city? If you remeid>er and understand 
the material on Sampling Distributions, you would correctly conclude that 
such a conclusion would be unjustified. 

Sampling Distributions Illustrate the concept of sampling fluctuation. 
^ If we select all unique samples of the same size from the same population, 
their means, X*s^ will vary: some will be equal to the population mean 
^Ux)> some will deviate slightly, and some will be real weird (extremely 
high or low). How do we know for sure that the sample we tested from 
Genius Jr. High Is a "good" sample? Sure, we used random selection proce- 
dures In choosing the sample, but, as we know, the means of even randomly 
selected samples vary from one sample to the other. 

All that we know for sure 1^ that the meaa^f our sample was equal to 
127; and the (for the city) « 100. We could Justifiably make the fol- 
lowing conclusion: Judging from our results, It appears probable (not 
certain) that the average IQ of Genius Jr. High students Is higher than 
the average IQ of students In the city. We cannot be certain about this 
because we have no way of knowing for sure that the sampla we tested 
(although randomly selected) was "perfectly" representative of the Genius 
Jr. High student population. If we wanted to take the time and trouble 
to test ev ery. Genius Jr. High Student , then we could make a certain con- 
clusion . If Ux for Genius Jr. High turned out to be 125, we could say 
positively that this particular school population has a higher average 



228 

24j 




IQ than the city student population. Thus, in hypothesis testing ve 
generally use samples to make inferences about populations; because we 
use samples (to save time and money), we lose the capability of being 
, certain in our judgments. We, therefore, become restricted to express- 
ing our conclusions in terms of probability statemeats. 

8.1 POSTTEST (answers inTback) ^ 

Briefly describe why probability rather than certainty is the rule in 
J hypothesis teeing. 



8.2 Instructional Unit : The null and alternative hypotheses 

Whei^veT you formulate statistical hypotheses, they should be exhaus- 
tive, covering all possible outcomes of your experiment or study. For 
this reason, we refqr continually in statistics to what are called the 
null hypothesis and the alternative hypothesis (hypotheses). The conven- 
tional approach in experimentation is to always state a hypothesis of 
"no differences," called the null. The null hypothesis which we will 
symbolize as " H^) " calls attention to the possibility (hypothesis) that 
there are no real differences between the populations you are comparing 
in your experiment (this idea will be better explained later) . 

Specifically, in statistical "practice, the null hypothesis (i.e., 
\ the hypothesis of "no differences") is the one that is actually tested. 
\ If there is sufficient evidence to reject it as being false, one can 
\ conclude, within certain probability limit^ that differences do exist. 
\lf there is not sufficient evidence, the null must be retained. As 
Idifficult as the logic may at first appear, the objective really is to 
(gather evidence that is convincing enough to "disprove" certain assumed 
'conditions (i.e., the null). If such .evidence is present, the null is 
Rejected in favor of the decision that differences (probably) do exist, 
kn the absence of such evidence, we don't "prove" the null, but merely 
detain it as a condition that still remains possible or acceptable. 
That is why in reading research papers you will encounter reports of, the 
kull being rejected or retained, but neverof it being "proven." The null, 
kike the condition of innocence in law, is the "assumed" stat6 of affairs 
^ntil there is suf f icient*cause to doubt its validity. In fact, the term 
"null" derives from the conception in philosophy of science that the 
orientation of eii5)irical research is directed to nullifying hypotheses. 
We will proceed in a lighter vein below (take a deep breath or two before 
continuing) . 

Given the idea of (the null), the alternative hypothecs, symbol- 
ized as Ha covers all other possible outcomes of the experiment. Given 
He and Ha, you have eidia^jsted the field in terms of the po^ilfi^e 



229 



24 i 



ERIC 



I 



flndlngs^of yd^ experitnent. Let's try an example. 
Example : - 

We perform a study to determine whether the mean IQ of Genius 
Ji. High students is higher than the mean IQ of all junior high students 
in the city. We test this by selecting a random sample of 25 Genius Jr. 
High students, and determining the mean IQ of the sample on the Stanford 
Binet IQ test. We OTknpare this score to the mean IQ of all juniot high 
students in the city (that Is, the Jr. High population ) , data which we * 
collect from the School Administration records. Keep in mind that what 
we are really asking is "Does the Genius Jr. High School population 
(we'll symbolize vthis as uj) differ in terms of^lQ from the city popu- 
lation (we^ll symbolize this population mean as'a2)?" We can directly 
determine a2 from city records, but we must infer a; (Genius Jr. High 
population mean) on the basis of our random sample. 

What is the null hypothesis in this study? ^ 
What is the alternative hypothesis? 

Ans.r First, state your null hypothesis, keeping in mind that the 
null represents the prediction of "no differences 

Hoi uj (Genius Jr. High) - U2 (City) 

Note that the i^ull, by stating « indicates the possibility 
that there are really no differences, between the average IQs of 
the two populations . 
J 

Then state your alternative hypothesis (hypothedles) , keeping 
in mind that they must represent all remaining possible out- 
comes. We can state one exhaustive alternative hypothesis ^ 
very simply as follows: 

Ha' u; ^ a2 ^ . 

Note that when^ijped in Conjunction with the null, the alterna- 
tive covers all possibilities: that is, you will either find 
the two population means to be eqiial or you will find them to • 
be not equal . ^ 

' You can get more specific, however, if you so desire: 

Ha' U| < U2 (which suggests that the city mean will be greater 
\ than the Genius Jr, High Mean) 

But you cannot stop here: hypotheses must be exhaustive. You 
need aH least one more alternative: 

H^':a| > a2 (suggesting that the city mean will be less than 
the Genius Jr. High Mean) 

This time our three hypotheses cover all possible outcomes: a) no 
differences (null); b) city greater than Genius (Ha:); c) city less than 
Genius (Ha*** ) . ' 

230 ^ 

^ 24j ■ 



Example : i u r 

A researcher believes that males. who were born in the month of 
February ^re taller than average. From the govemtaent census dhfa he 
finds the mean height of all males over 17 years of age In the U.S. 
He also uses the cellsTis data to contact and measure a random sample of 
loo males (over 17) who were bom in February. He will use this sample 
to generalize (estimate) the population of all males bom In rh^pmry. 

State the null and altematlye hypotheses: 

One possible answer ? Hq : Uj (Feb. males) - (all males) 

Note that'these two hjjjjotheses are exhaustive in terms df outcomes. 
Another possibility ; Hq : uj " U2 

Ha 2 < U2 

Note that these three are exhaustive and thus correct. 
An incorrect answer : : uj » U2 

Ha • ^1 < U2 

Note that 'these do hot cfti^er all possibilities; what if tums 
out to be t^reater than 

An incorrect answer : Hp t uj » U2 ^ 

K Ha ^1 ^ H 

H(x' 2 U| < U2 / 

Although these are exhaustive they are riot mutually exclusive^ 
suppose, you find that aj is less than a2^ Do you accept 
(which Is technically correct) or Ha' (which is also correct). 
You cannot have two alternatives supported at the same tiiflo. 
Ha should be changed to aj > a?, <Jr Ha' should be eliminated. 

Final Notes ; t» ^ - ' 

1. Statistical hypotheses should always be stated in^tertna of a null - 
hypothesis (syibolized Hq) and one or more alternative hypotheses 
(symbolized Ha, Ha' )• The null hypothesis represents a prediction 
of "no differences"; for example, aj • af. It should bo noted, 
however, that the null can be expressed in terms of actual pora- 

» meter values. For instance, a researcher may want to test the 
hypothesis that the mean IQ of City X is greater than the mean IQ 
of the U.S. population, which he knows to be 100. Using the pro- 
cedure we are accustomed to, he can specify his hypotheses as follows: 

231 



/ 



^ H^t Uj (city) » U2 (U.S.) 

» 

But since he is testing the prediction that the city mean wilL-' 
be greater than 100, he could specify his hypo-theses in a slightly 
different manner (one that really means the same thing, however): 

Hq: uj ^ 100 * * ^ 

Ha-- u| 4 100 ^ 

Instead df using the parameter symbol for the U.S. population, tig, 

he used the parameter value, 100. The choice is pretty much arbitrary. 

2. The specifications of hypotheses using the null and altemative(s) 
V should be exh^tistive and nonoverlapping (i.e., no two alternatives 
3hould cover the same possible finding). 

8.2 POSTTEST (answers in back) vi 

l. i A school guidance counselor is interested in .determining whether the 
seniors in his high school receive higher SAT scores* than average. 
He knows from Educational Testing that the mean SAT score of all 
seniors in the United States is 500. He selects 30 seniors at "ran- 
dom from his school, and compares *their average SAT performance to 
the United States norm. , 

Specify the null hypothesis and the alternative (s) . (Note that 
there are several possibilities.) 

f 



For each set of hypotheses shown below, indicate whetlyer the si^pcifi- 
cation is incorrect or correct. If incorrect, explaiii^hy. t 



Set 1 






Set 2 




Set 3 




Ho : UJ, / 


U2 ^ 


Ho 


: UJ = 


50 


Ho : 'UJ = 
r 


U2 


Ha •• UJ = 


U2 


"Ha 


•• UJ ^ 


50 


Ha« UJ < 


U2 












Ha': UJ > 


U2 


Set 4 






Set 5 




Set 6 




) 

Ho •• UJ = 

* 


U2 


Ho 


: UJ - 


23' 


' Ho : Ml . = 


U2 


Ha ti/ > 


U2 


(Ha 


: UJ =• 


22 


Ha : uj^= 


WO 






Ha' 


: UJ < 


23 







232 

ERiC . 247 



8.3.1 Instructional Unit : Type I and Type II errors 

In review, the basic procedure used In hypothesis testing Is to: 

1. First assume that the null hypothesis Is true ; that the 
population mean you are estimating (uj) Is actually equal to 
the population mean to which It Is to be compared (uf) . 

2. Having made the above assumption (for example, Uj » 100), 
consider the Sampling Distribution for a|, based on samples 
of size W. • * 

3. Given the above Sampling Distribution, determine whether 
the sample mean, X, Is "probable" or "Improbable" (we wlll^^ 
see Ikter how to decide more objectively what Is "probable") 



4, If the sample mean, X, turns out to be "probable," the 
researcher retatns the null hypothesis, and. In essence, 
concludes that there Is no difference between Uj and 

5. If the sample mean, X, turns out to be "Improbable," the 
researcher rejects the null hypothesis and accept^ one of 
the alternatives. What he has done. In essence, Us to 
reason that given the Sampling Distribution that_represents 
the null condition (Uj = 100), his sample mean, X, turns 

out to be liiq)robable.7-one that would be found for very few of 
the Infinite samples' oiE size n. Therefore, having nothing 
else to go on excepting his sample mean, he concludes that 
the above Saiiq)llng Distribution probably doesn't apply to the 
population In question. The null hypothesis Is rejected In 
favor of one of the alternatives. 



At this point It Is not Important for you to fully understand how 
the researcher decides that his sample mean Is "probable." As you will 
find out later, this Is pretty much an arbitrary determination which may 
differ from researcher to researcher and from study to study. Nor is it 
important for you to memorize the five steps listed for hypothesis test- 
ing. It is essential, however, that you do acquire an understanding.^ 
the logic, as reviewed below: , ^ 

If the null is true, then my sample^ of n cases will relate 
Td a Sampling Distribution with E(X) = Ux and ^ . 

/ Given that particular Sampling Distribution, does my sample mean, 
X, appear probable or improbable? 

If X appears improbable^ then there is a good chance that the para- 
meter values,* and o, used to construct the Sampling Distribution 
do not describe the population from which the sanq)le was drawn. 
Therefore, it seems reasonable to rfeject the null hypothesis and • 
accept the alternative. The alternative says, in essence, that there 
are differences between aj and Uf. 

^ 233 

ERIC ; 24 o 



Trial of the Tenpins: ^ 
A Tale About Nullifying the Null 



At great risk of confusing the issue, but out of an unwillingness 
to abandon our "law" analogy, a final attempt will now be made to illus- 
trate the logic of hypothesis testing in terms of deciding between 
innocence and guilt in trial. Use your imagination and pretend that 
deviancy in the students served by schools is considered a crime in the 
fictional country, Untied Stakes. School XYZ is up for trial, accused 
by a twelfth-grade double agent of serving a student populace that dif- 
fers in bowling average from the "accepted" national standard of 100 as 
a school average (with a = 15), Based on the convention of assuming 
innocence until guilt is established, the jury must begin with accepting 
the premise: 

Uj (School XYZ) = 100 

The task of the prosecutor is to present evidence to convince the 
jury that this condition is improbable , given that actual conditions can- 
not be enacted (all 5,000 XYZ students cannot be tested). If he succeeds 
in presenting this evidence, the jury will have no choice but to accept 
the alternative, uj ^ 100, and chances are that the 4udge will pronounce 
the XYZ studfent population as deviant ! With the oriigioal records unavail- 
able and conditions making it impossible to test every student involved, 
testimony (in this case, bowling scores) i£ obtained from a random sample 
of 25 students. The sample averages 108 (X « 108^ on the lanes. The 
prosecutor then takes this evidence and tries to argue that it makes in- 
nocence (uj « 100; the null) seem extremely improbable. The defense 
attorney argues that this seemingly incriminating evidence is merely 
circumstantial (obtained by unusually good rolls); specif ijcally, he states 
that it doesn't preclude the possibility that if all XYZ students were 
tested, rather than just 25, dj would = 100. Suppose that you were a 
member of the jury, how would you vote? 

Highlights of the Actual Trial Proceedings 

The prosecutor's closing speech goes as follows: "Ladies, gentle- 
men, and others, the Defendant, School XYZ, is obviously guilty, based on 
the evidence that has been rolled before you on these lanes. If we were 
to assume that Uj = 100 (with o = 15) as my esteemed colleague (the De- 
fense Attorney) c^laims, then a Sampling Distribution based on all unique 
combinations lof 25 students selected from this population would be^ charac- 
terized by E(X) « 100 and ay = 15A/55' « 3.0. Using the normal curve 
probability model which you ha^ before you, this would indicate that 
roughly 68Z of the samples of ^^25 would average between 97 and. 103 
in their bowling scores (between + 1 S.E. from the mean); roughly 95% 
would have averages between 94 and 106 (between ±2 S.E. from the mean). 
Yet the random sample presented as evidence in this trial was shown to 
average 108. Again, if you will refer to your normal curve table, you 
will quickly see that after converting the submitted score of 108 to an 
appropriate z score, (108-100)/3 = Z, the resultant value of +2.66 is 
so high that it would be obtained less than 1% of the time (50% - 49.61% « 
.39%), a highly unlikely occurrence. Thus, I submit that for School XYZ, 

> 

234 

24J 



aj does not equal 100, but represents a val^ue somewhat higher, a value 
that could more appropriately account ;for X = 108. DEVIANCY of this 
school is definitely indicated, and a vote of GUILTY is the only possible 
decision." 

The defense attomey^s speech was rather long, as* he tried to show 
that the conditions of testing the sample were unusual relative to 
"typical" bowling events (representiveness of conditions should always 
be an important concern in using sample data to make inferences about 
"true" states) # He argued that the excitement of using official bowling 
lanes added points to the students' scores (and hoped that this argument 
would do the same to his case^. His main arguments, however, can be 
summarized as follows: "Ify colleague has taken you thorough the normal 
curve probability table in an attempt to have you believe that Uj « 100 
does not apply to the Defendant. It has been stated that if suctj a 
parameter did apply, an X as high as 108 would be obtained from a random 
sample of n = 25 less than IX of the time. If you, the members of the 
jury, view your role as it is defined by law — to assume innocence unless 
guilt is proven — it is that very figure that must lead you to cast your 
vote fox 'innocence.' Given the infinite range of normal curve values, 
sample X's even higher than 108 are_theoretically possible, given a 
population = 100. The obtained X of 108, although higher than most 
of the X's we would obtain if other samples had been tested, is merely 
a product of sampling fluctuation. School XYZ does average 100_overall, 
and if the trial were repeated, we could just as easily obtain X « 92 
as we did the 108. Innocence cannot be ruled out based on the evidence 
submitted. Therefore, the only possible decision is for a vote of NOT 
GUILTY." 

How would ydu vote now? 

I: 

The Out come 

The most heated discussion among the jury concerned whether or not 
the biwling scores used as evidence were obtained under representative 
conditions. After failing to resolve this for 38 successive days, they 
finally reached unanimous agreement that the data should be regarded as 
valid. With this decided, the verdict submitted was GUILTY. The judge, 
feeling that leniency was in order as this wa^ a first offense, sen- 
tenced the school to serve six months at a fast food restaurant, after 
which it could return to providing education if students' bowling 
averages were lowered to conform with the national average. 

Epilogue 

The yarguments preapnted by th^ opposing attorneys are much the same 
as the reasoning applied by the statistician-in deciding whether to re- 
ject or retain the null hypothesis. In hypbthesis testing, as in court 
cases, decisions are not based on certainty but rather on probabilistic 
judgments that weigh the amount of evidence brought into view. In the 
e3cample, the prosecutor showed that an X as high as 108 will occur less 



than IX of the time given the assumptions of the njill (I.e., 
asfitumptlons of innocence). The Jury, forwarding a verdict of 
"guilty," presumably used some subjectively selected criteria to 
decide that the IX probability value was sufficiently sinall to 
reject the validity of those assumptions. In statistics, as we 
shall see shortly, conventional practice frowns on subjective 
choices of such decision criteria by establishing ories expected 
to be applied in all (ot most) published work. But, even though 
in our example trial, the "weight of evidence" seemed to go aga^st 
the school population averaging 100, can we be certain that the 
jury made a correct decision? We cannot, as there is still some 
probability that the 108 was simply an extreme value in the Sam- 
pling Distribution that dpes define « 100, o = 15, n « 25. thup, 
since statistics are only estimates of population parameters (and , 
testimony in a courtroom only remembrances of actual happenings) , 
the decision, whether to reject or not reject the null (guilty or 
not guilty), will always be associated with some degree of doubt. 



* * ic * * * * * it it * * * * * * 



If you understand the logic of hypothesis testing as described, 
feel free to merely skim the next few pages (up to Type I and Type 
II errors). If you are still fairly fuzzy and uncertain, do not give 
up hope - additional explanation will be provided below. 

Additional Explanation ; (Skip or skim this section if you under- 
stood the preceding material.; 

Let's continue to work with the example used earlier, but without 
the analogy to law. A researcher knows that IQs are d:|Lstributed nor- 
mally in the U.S^ population with lOQ and o « 15. He hypothesizes 
that the students ii\^^School XYZ are^ brighter than average, which will be 
evidenced by their hi^gher IQ scores. He cannot possibly test every 
single student in the school, so he decides to use a random sample of 
25 cases. Keep in mind that he is really trying to compare one popula- 
tion mean (the U.S« average, which we'll call Uf) against another 
population mean (School XYZ average, which we'll call aj). But he cannot 
determine U; directly, so he uses a sample to estimate it. He .specif ies 
his hypotheses as follows: 

Ho-', uj « U2 (OA. TOO) 

He goes out and tests |MLs sam£le of 25 students. He computes their 
mean IQ and finds that it is X » 109. Now he must make a decision: 
"Should I retain the null or reject it? Does X = 109 suggest' that 
uj (school pop.) « U2 (U.S. pop.), or that uj ^ ug?" 



236 



9rr ; 



To answer the question, he reasons as follows: if the null 
hypothesis is true, then the population of School XYZ has " 
and 0 = 15. If an infinite nunber of samples of size 25 were drawn 
from this population and tested on IQ, the Sampling Distribution of 
means would have E(X) = 100 and or » 3. 
(By now you should know how we derived these values.) 

The Sampling Distribution would look something like this: 




ERIC ' 



237 

252 



since the researcher is totally familiar with the normal 
curve and its assumptions, he regards the Sampling Dis- 
tribution as something that shows the probability of 
obtaining size-2S sample means of different values selected 
from a population where a » 100 and a » 15, 

Do those parameters describe School }(VZ? 

We have no wa^ of knowing for sure, but we can look at our 
sample mean, X = 109, and decide whether it is or is not a 
•'probable" occurrence, assuming that those parameters do 
describe School XYZ. If it turns out to be a "probable" 
occurrence, then there Is no reason to believe that School 
XYZ does not have a mean IQ of 100; thus, we will retain 
the null. If it turns out to be an "improbable" occurrence, 
then we have ^ood reason for suspecting that School XYZ has 
Uy 9* 100; therefore, we reject the null and accept the 
alternative . 

Getting back to the researcher; he looks at the above 
Sampling Distribution, and knowing about normal curves and 
probability, he reasons that a sample mean « to or greater 
than 109 (z - +3.00) will only occur about.1% of the time 
(50.00% - 49.87% - .13%, the latter figure corresponding 
to the shaded portion 6f the curve) . 

Could his sample of 25 students with X » 109 be one of that 
.1%? Perhaps, but the reseatcher decides that probably that 
is not the case. His sample mean of 109' appears very 
"improbable," assuming the sample was drawn from, a popula- ^ 
tion with » 100. Therefore^ he concludes that his* 
sample was^^obably drawn from a population (specifically. 
School XYZ) where 100. 

Vfith that conclusion, he has Tejpeted the null and accepted 
the alternative. 

Can he be entirely certain that his conclusion 100 
is a correct one? No, he can't be certain because some of 
the sample means Cl% to be exact) selected from a popula- 
tion ^ere - 100 will be as high as 109. Maybe the 
sample he selected was, by chance, exceptionally brightVpn IQ tests, 
Butj given the data, it seems much more reasonable to asi 
that School XYZ probably has a mean IQ higher than 100, 
and that this particular sample with X « 109 was merely 
reflecting that parameter value. 

Another Example (Study, Skim, or Skip, based on need) 

A researcher knows that height of males over 17 in Texas 
is normally distributed with - 72" and o « 10". He 
wishes to determine whether males over 17 in his home town, 




238 



25. 



> 



Austin, are taller than the state average • He can't 
possibly test every male over 17 who lives In Austin so 
he decides to use a random sample of 100 cases. He 
measures the 100 men in his sample and finds that 
X = 72.5". 



He states his hypotheses as follows: 
HqI Uj (Austin) « U2 (Texas) 



a 



a. 



To decide on whether to reject or not reject t^e null, he 
reasons as follows: If the null hypothesis true, then 
the population of males over 17 in Austin, Texas, has = 
72. If an infinite number of samples of size 100 were 
drawn from this population and ineasured on he^ight, the 
Sampling Distribution of means would have E(X) = 72 and 



This particular Sampling Distribution would look SOTiething 
like this: - * 

If the null is true: 




Sample X*s 
score 

The researcher regards the above Sampling Distribution as 
showing the probabilities of obtaining sample means of 
different valuee^when an infinite number of size-lOO sam- , 
pies are selected from a population where 72" and o « 10. 

Does that description apply to the population of all males 
over 17 in Austin, Texas? 

The researcher has no way of knowing for sure, but he can 
look at his sample mean and^ determinje whether pr not it is a 
"probable" occurrence, assuming that the Austin, 
Texas is 72". If it turns out to be improbable, then there 



ERIC 



239 



Is good reason £or rejecting the null; therefore, there 
Is good reason for suspecting that Austin Is characterized 
by ^ ir\ ^ 

Okay, so the researcher examines his sample mean, X « 
72.5", In relation to the Sampling Distribution of all 
means based on notes that the Standard Error 

of the Sampling Distribution Is ° 1. Therefore, for a 
sample mean of 72.5",- z = 40,50 Iz « (72,5 - 72)/l], Since 
he Is familiar with the normal curve and probabilities, he 
concludes that a score within the range z = +0,50 Is really 
a very common occurrence, not very Improbable at all. 

On the basis of this reasoning, he decides that the data 
does not support rejecjj.ng the null. A sample mean, based 
on 100 cases, which Is equal to 72, 5" would not be uncommon 
or unusual If the sample were selected from a population 
where u^^ » 72. 

His final conclusion Is one of "no differences" between the 
height of males in Austin, Texas, and the height of males 
In all of Texas. 

Noce: At this point you should be more concerned with 

getting a "flavor" of the logic used in hypothesis 
testing than with trying to understand exactly' how 
"probable" and ^'Improbable" are to be determined. 

Type I and Type II Errors 

Throughout this unit, we have been trying to stress 
the point* that hjrpothesls testing Is a game of probabilities. 
If you could measure every single Individual In both popu- 
lations you are comparing, no guesswork would be Involved. 
But since you must use sample data to estimate at least one 
of your parameters, you can never be certain that the con- 
clusion you reached Is absolutely correct. Suppose In the, 
above example the researcher measures his sample of 100 males 
and finds X - 76". He probably would reject the null by 
reasoning that a sample mean of 76" Is exceedingly unlikely 
to be found If Austin, Texas males really averaged only 72" 
In height. That may be true, but It Is still possible that 
the sa9>ple he happened to select wias "one In a million," 
consisting of 100 very tall males. Thus, although It seems 
Improbable , It Is still possible that he would be making an 
error by rejecting the null* The error would be an unavoid- 
able one, but It Is nevertheless Important for you to under- 
stand that whether you decide to reject or retain the null, 
the decision Is always subject to errors. 
# 

There are two types of errors to which our conclusions 
are subject. Only one type can be made for a given experi- 
ment. We will soon find out why. 



240 

^00 



The^'flrst error with which we should be concerned Is 
called a Type I error. We make a Type I error when we 
falsely reject a true null hypothesis. This means that on^ 
the basis of a sample statistic which appears "improbable," 
we conclude that the null hypothesis is false; i.e., 
U| t» U2- actuality, however, the null hypothesis is 
true — the two populations being compared are really iden- 
tical; it's just that the sample we selected to estimate 
the parameters for one population was, by chance, unusual, 
giving us a false imipression. 

Examp]^e: ' 

A researcher tries to determine whether the students 
in School XYZ have math scores higher than the U.S; popu- 
lation norm, which is 100. If he had time to actually 
test the total School XYZ population, he would find (to 
the school administrators' dismay) that the mean score 
for the school is exactly equal to 100; XYZ has "typical" 
students— U;^ (School XYZ) - 100. But, as is normally 
the case, it is simply not practical in terms of time and 
money to test the IQs of every student at XYZ. Therefore, 
he uses a sample of 49 randomly^selected students from all 
grades. His results show that X » 115, causing him to 
consider it "improbable" that such a sample could have 
been selected from a population where Ux - 100. He rejects 
the null, and to the school administrators' collective 
delists, accepts the alternative, Ux>100. ^ 

The researcher has unknowingly made a Type I error. 
Why? The sample he selected was, by chanoe, an exceedingly 
unusual one. He did use random selection procedures, and 
thus, cannot be faulted. Chances are that the vast mlajor- 
ity of samples of 42 students selected from the XYZ popu- 
lation would yield X's very close to 100.^ But, for whatever 
reason (actually the reason is sampling fluctuation), his 
particular sample of 49 happened to be comprised of very 
bright stiidents from XYZ. The administrators will now 
happily report to the community thaj XYZ is a "bri^t" school, 
but if the study is done again, the results will probably show 
the truth— XYZ is merely average. 

Anytime you reject the null, there is some probability that 
you have made a Type I error. 

The second type of error. Type II . occurs when we retain 
a false null hypothesis. This means that on the basis of a 
sample statistic that appears "probable," we conclude that 
there is no evidence to support the rejection of the null. 



241 2.^ 



ERIC 



In actumlltyp however, the null hypothesis is false — the 
populations we are comparing really do differ; It Is Just 
that the sample we selected to estimate one population 
parameter happened to be unusual, giving us a false 
Impression. 

Example: A researcher tries to determine whether students 

In School ABC have higher math scores than the U.S. 
population norm, which Is 100. If he had time to 
test all 3,000 of ABC's students he would find 
that this Is really the case — ABC has a whopping 

of 127! He doesn't have much time or money so 
he tests a sample oF 49 students, selected at _ 
random from all grades. His results show that X ^ 
101, causing him to conclude that It Is very 

^ probable that his sample would have been selected 

from a population where a " 100. He thus has no 
evidence with which to reject the null, and must 
^ retain It. He sadly reports to the school admin- 
istrators that their students are merely ''average." 
But* If the same study were to be repeated, the 
results would probably show that School ABC Is 
really quite a' bit above the national average In 
student math achievements. 

This researcher has unknowingly made a Type II 
error. Why? The only 1^/ldence he had for /accept- 
ing ojr rejecting the null was his sample statls- 
tlc^ X " 101. By chance, the A9 i^pidents 
comprising that sample, although randomly selected, 
happened^ to be considerably lower than the actual 
ABC average, « 127. The sample data gave him 
a false impression. 

Anytime you accept (retain) the null hypothesis, there is 
some probability that you have made a Type II error. 

The relation between possible outcomes and Type I and II 
errors is shownr:?in the table below: 



Ho 

rej ected 

Outcome of 
Experiment q 

retained 



Actual Parameter Condition 
Ho TRUE Hq false 



Type I error 


CORRECT DECISION 


CORRECT DECISION 


Type II error 



ERJC 



242 



'5; 



Note cm the table that you cannot make both types of errors 
at the sane time. If you decide to rej ect the null hypo- 
thesis, you have made either a) a Type I error, provided 
the null Is true In actuality; or b) a correct decision, 
provided the null Is false In actuality. 

If you decide to retain the null, you have made either a) 
a correct decision, provided the null Is true in actuality; 
or b) a Type II error, provided the null Is false fin 
actuality.^ 

Since, In a given experiment, you can either rej ect or 
retain (but not both), you run the risk of either a Type 
I or Type II error (but not both). 

Practice Questions: For each of the following, Indicate 

whether the researcher has made (a) a 
Type I error; (b) a Type II «;ror; or 
(c) a correct dedlslon \^ 

(use a cover sheet to block answets 
from vlew)^ 

1. On the basis of his sample data, a researcl^er decides 
to reject the null by concluding that girls In the 
ABC Day Care Center are better than average In phy- 
sical skills. In actuality, however, had he the 
time to measure the whole ABC population, he would 
hove found this population to, be merely average. Error? 

ans. The researcher has made a Type I error by ^ 
rejecting a "true" null hypothesis. 

2. The average SAT -Verbal score for all high school 
students Is 5(K). ^A researcher tests a Irandom^^sample 
of 50 students who live In Montana and finds X « 
498, a "probable" statistic. He decides to retain 
the null. In actuality, the average SAT-Verbal 
score for the Montana population Is exactly 500. 

ans. The researcher has made a correct declsloh by 
retaining the null; there are really no dlf- 
lEerence» between (Montana) and (U.S. popu- 
lation) . 

3. Suppose In the above example, the actual Montana 
average Is • 450. What then? 

ans. The researcher has made a Type II error by falling 
to reject 6retalnlng) a false null hypothesis. In 
actuality, U| U2 ; his sample data, although 
appearing to be "probable" was not a true Indica- 
tor of the Montana average. 



ERLC 



243 



4 

Keep in mind that: Type I and 11 errors ar^ totally uninten- 
tional, and, for the most part, undetectable in practice. All 
you cSn do is decide whether to retain or reject the null on 

^he'hasis'of yoqr sample data. Since your sample is selected 
it random, yoif are correat in assuming that it will provide a 
reasonable estimate of tbe population parameters in question* 
. Due to the problem ^of sampling fluctuation, this will not al- 
ways be the case; sometimes your sample da^ta will be mislead- 
ing. When this happens, you may end up rejecting, a null hypo- 
thesis thav is actually true (Type I), or retaining a null 
hypothesis that is actually false (Type II). Unless you re- 
plicate the study, you will never be aware that an error was 

; made. 

(§.3 Post test includes the following subsection) 
8.3.2 Probability Level 

If you have an alert, inquiring mind you have probably 
been somewhat frustrated by the «discusslon on hypothesis test- 
ing procedures and Type I and II errors. The reason for your 
frustration might revolve around the question of determining 
when your sample statistic should b.e considej'ed "probable" 
(suggesting retention of the null) and when it should be con- 
sidered "improbable" (suggesting rejection of the null). You 
may ask; "Aren't there any rules to the game?" 'Are these im- 
portant decisions left totally up to the whims and moods of 
the researcher?" "Jsn't there a way to determine objectively, 
what is probable or improbable?" ' 

The answer to all of the above questions is "clearly yes and 
\no." 

• * ^ , 

You may recall from our fictitl#nal account of the "Trial 
of the Tenpins,"^ the jury voted "guilty" on the basis of the 
prosecutor's argument: 

IF School XYZ - 100, an X as large as 108 
would be obtained in less than IZ of all ran- 
dom samples of H « 25. 

An^X - 108 was obtained for t^he sample selected. 

, , IZ is a "very low" probability. 

Therefore, it is "probable" that Ux for School 
XYZ > 100. 

The jury subjjfctively decided that the X - 108, £ (probability) 
< IZ, was sufficient to reject the null condition of innocence. 
If a much more copservative jury had been involved perhaps the 
null would have been retained - stronger evidence (maybe an X - 
109) might have been needed to convince them. At the conclusion 
of that section, it was^implied that such subjectivity in se- 
lecting' decision criteria, although unavoidable where actual 



ERLC 



244 

25j 



probability^ estimates cannot be derived (as In real trials), 
Is reduced to some degree In research through the adoption of 
conventional standards fbr retaining and rejecting the null. 

However, even though there are rules to the game, much In 
the way of making decisions Is still left up to the Individual 
researcher. In the remainder of this section, we will discuss 
some of these rules and procedures. Sometimes they may seem 
difficult to understand, and therefore will require careful 
reading. If you decide to merely skim over this section, you 
will most likely be wasting your time* Give It your full 
attention. 

Prior to the actual running of an experiment. It Is ex- 
pected that the researcher will specify what Is referred to 
as a level of probability . This specification Indicates In 
clear, precise, and objective terms^^how different the samp}.e 
statistic, X, must be la relation to expected (probable) values 
Ih order to Justify rejection of the null. Thfe probability 
ifevei, as you might expect, Is specified In probability teps, 
rather than In score values. But through the use of Sqppllng 
Distributions and the normal curve, the researcher will be 
able to specify his acceptance or rejection levels In terms 
of real score values: For example, "I will reject the null 
hypothesis. If my sample mean turns out to be greater than 
UO or less than 90j;slnce botlv i#els appear Improbable, as- 
suming that the null were true)/' ^)nce these levels are 
clearly specified, he goes out and tests his sample. The Im- 
portant point Is that the criteria for accepting or rejecting 
the null are selected before the data Is collected, rather 
than on a post hoc basis. . 

How do you go about specif yl^i»-«- probability level? 

The conventional choice of Probability Is usually the .01 or 
.05 level, although exceptions are many times ejotlrely appro- 
priate. ^ 

What does .01 or .05 mean? ' ^, / 

Seemingly the best way to explain the use of probability level 
Is through an example. But for now, we can say that the .01 
level means that the sample statlstlc^you ebt;aln Is only 1% 
probable, assuming the^null Is true; the .05 level means that 
the sample mean you obtain is only 5X probable, assuming the 
nuW Is true. If you use a probability level of .01 and find 
that your sample data Is less than 1% probable (assuming the 
null Is true), you reject the null; If your samj^le data Is more 
than 1% probable, you detain the null. The same thing holds 
for the 51 level, except that In this case you would >be less 
conservative In your criteria for rcje^itlon. All this Is pro- 
bably fairly confusing, so let's apply It to an ^^jcample sit- 
uation and see how It works. 



Example Study ; v / 

An investigator knows that IQ is normally distributed in the 
li.S'. popufation with = 100 and a = 15. Hef wishes to deter- 
mine whether the adolfescetit populatlpn in the lobal juvenile 
hall is average in IQ scores. He carl '^t possibly test all 3,000 
juvenile ^all residents so he decides to employ a randomly se« 
lected sample of 25. He states his hypotheses ds follows: 

H,: a^ = 100', 



a' 



100 



He decides-.to use the' .05 l^vel of probability (also called, 
level of significance , and symbolized as a ^Ipha) . 

a « .05 . 

By specifying the significance (probability) level as a = .05, 
he is saying, in essence, that if his sample mean (statistic) 
turns out to be more than 5% probable (assuming the null is 
true),^*r^ will retain the null; if it turns out to be less 
than.i% probable (assuming the null is true), he will reject 
the /null. We will see how this works very soon. 

First, he will reason that if the null hypothesis is true 
(Uy = 100) , the means of an infinite number of samples of 25 
cases selected from the juvenile hall population will be de- 
scribed by a Sampling Distribution where: 

E(X) « 100 ^ ' • ^^^^^^^^-^ 

. ^« 15/^ - 15/5 - 3 

A sketch of this Sampling Distribution would look like 
this : 











4* 






\ 














^ i^.' A- 



0 * 



2 ^3 



246 



since the researcher knows about normal curves and pro- 
bability, he can examine the Sampling Distribution and reason 
as follows: "If this Sampling Distribution is valid (Ux = 100), 
then there is a 68% chance that I will obtain a sample mean 
somewhere between ^7 and 103 (between +lz and -Iz), there is a 
99.7% chance that my sample mean will fall somewhere bet^^een 
91 and 109 (between +3z hnd -3z) , aiid so on." If you're un- 
certain about how these pro))abilities were derived, check 
your "area under the normal curve" table. Won't 68% of the 
cases always fall between +lz and -Iz in a normal distribution? 

But he has specified his significance level as a = .05. 
What does this mean? In "hypothesis-testing" language It . 
means that the region of rejection for the null hypothesis 
(the "improbable" region) will include values that are not ; 
likely to occur more than 5% of the^time. What is this r^ion? 
We can pencil it 4n on the sketch shown below: 




? . 100 ? 

-1,96 0 1.96 



By examining the "area under the normal curve" table, we 
determine that 2.5% of the time, valines in a normal distribu- 
tion will fall above +1.96z, and 2.5% of the time, values will 
fall below -1.96z. Thus, 5% of the time (2.5% + 2.5%), values 
will occur that are either §<1MM Pr greater than +1.96z,^ or 
equal to or less than -1.96Z. Since in lihls^^case ot ^ ^05, the-^ 
researcher will consider this extreme 5% region of the curve 
as improbable . It can be thought of as the "region of re- 
jection." If the sample mean falls within this region (one 
of the penciled-in areas of the curve) it will be considered 
"improbable," and Justify rejection of the null hypothesis. 
On the other hand, the area of the curve that falls between 
-1.96 and +1.96 can be thought of as the "region of acceptance" 
(it comprises 95% of the area) . If the sample mean falls 
within that region, it will be considered "probable," and lead 
to retention of the null. 



' 247 262 

% 



Do you understand how the shaded regions beyond the +1.96 
and the -1.96 were selected? Reinetnb^^,,J;;Jiat the identified 
significance level, a = .05, defines an «^reme sample statis-' 
tic (one that is "deviant" enough to support rejecting, the 
null) as that which has a -probability of occurring 5% or less 
of the time assuming that the null is tr\fe. But an "extreme" 
value can be high or low, can' t^ it? Thus, the area of re- 
jection, encompassing 5%, of the total normal curv% area, must 
be equally divided between the high and low ends. The result 
has to be 2.5X at the top and 2.5Z at the bottom. How do we 
derive z scores for these regions? 

ans.: The same way we would if we were given the same problem 
*s . in Unit IV. 

Which z score has 2.5% of the area in a normal curve gS5ve it? 
The total area above z « 0 is 50%; thus, 50 - 2.5 = 4775% be- , 
tween "our" z and the mean. Look up 47.5 (actually .4750) 
on the "normal curve" table and you will f:^d that it most 
closely corrJBsponds to z = 1,96. The symmel^jy of the normal 
curve (or the^same mathematics if you've forgotten what sym- 
inetry means) gives Z =^1.9iito define the 2.5% region ^t the 
bottom end. Now, how would the situation change had the re- 
searcher decided to use a « .01? Here, he's being more cau- 
tious, as he's saying that extremity will be defined as sta- 
tistics likely to occur 1% or less of the time given the null 
assumptions. Since extremity can be either high or low, the- 
region of rej6<;t ion must be balanced to leave half of 1% at 
the top (.5%) and half at the bottom' ( .5%) . What z scores 
define these regions? The positive z must be one that has 
.5% of the scores above it and, therefore, 50 - .5 = 49.5% 
betwfeen it and z = 0. Look up .4950 on the table and you get 
Z = ;Z.58. At the bottom, the region would have to be defined 
by z = -2.58. Compare these to the + 1.96z*s and it should 
be obVious that the .01 level is asking for "more convincing" 
evidence for rejection of the null than is the .05 level. 

Without reading further, can you deterffl^e what score 
values would correspond to the 1.96z regions? The answer 
simply involves converting the z scores in question into raw 
scores, by use of the basic z score formula. 

- 

X = a z + X 

Xi =^ 3 (1.96) + 100 (this is the score corresponding to 

2^ = + 1.96; see sketch) 

Xi = 105.88 

X2 - 3 (-1.96) + 100 (this is the score corresponding to 

z = -1.96) . 

X2-94.12 ^ . 

248 0/» . 



Ue can shov thase values on tha Sampling Diatributlon as 
follova: 



Now the ^researcher can go out and test his sample of 
25 Juveniles. If the mean IQ of the sample, X, turns out^ 
to be greater than 105.88 or less than 94.12, It will be 
considered "Improbable." 

He tests his 25 Juveniles, anJ finds X - 90.45. What is 
his decision? 

He rejects the null. Using a .05, his sample mean 
appears "improbable." Such a value will be fo\ind less 
than 5% of the time when an infinite number of size-25 
samples are selected from a population where ux * 100. 
Therefore, there is very good reason to suspect that 

(Juvenile Hall) ^ 100. The mean of this population is 
most likely lesa^than 100; otherwise, it seems extremely 
unlikely that 90.45 would have been obtained. 

But suppose, in actuality, (Juvenile Hall) is equal to 
100. What type df error would the researcher be making? 

ans. A Type I error: rejecting a true null hypothesis. 

Example Study II 

» *'* 

Suppose the researcher decldefd to use a - .01 in the 
exact same study. 

Everything would be the same, but this time the 
researcher is being fnore conservative in his criteria 
for rejecting -tlie null. By using a - .01, he is say- 
ing that he wlli?feject the null hypothesis. If hi« 
sample mean is determined to occur only It of the 
time (or less) assuming the null is tru^^ If he used 
the .05 level, he would reject the null if the sample 
^ mean was determined to occur 5^ (or less) of the time. 

Can you see how he is being more conservative this 
time in his criteria for rejection? 



249 



If you can't see the difference perhaps It will become 

iDore obvious a^ we continue. 

To review from our earlier discussion, 

since he Is using a - ,01, the region of rejection on 

the Sampling Distribution will correspond to the area 

that Is only 1% probable in relation to the mean 

(Expected Value,- 100). More specifically, It will 

correspond to two areas, one that accounts for .5% of 

the area above the mean, and .5% of the area below the 

mean (.5% + .5% - 1%). j ^ ^ 

Looking In the "area under the normal curve" table we 
find that the area aboye z « 2,58 Is approximately .5Z 
of the total (50.00Z «i 49.51Z = .05), and likewise, the 
area below z - -2.58 1^ approximately .5% of the total. 
Thus, we can Illustrate our region of rejection as 
follows: < 



REJECT 




I- 



A z = +2.58 corresponds to a raw score (actually a sapple 
mean value) of 107>»7A: 

' - 2.58 (a) + X 

' =^-2.58(3) + 100 

= 107. 7A 

AZ " -2.58 corresponds to a raw score of 92.26 
X2 - -2.58(3) + 100 , 
- 92.26 



Both of these values are shown on the dlagra^Ea. 

Now, suppose the researcher goes^ out and tests his sample 
of 25 juvenile's and finds that X » 93. What Is his declslbn 

He decides to retain thCi^null; X « 93 falls within the 
region of acccrptance. It Is more titan 1% probable (assuming 
the null were true), and thus does not constitute suffi- 
cient support for rejection. 



< 250 

ERiC 



Suppose that the mean IQ of the Juvenile population la». 
In actuality, leas than 100 • What type of error would 
the researcher be cooialttlng? 

ans. A Type II error: ^failure to reject a false null 
vhypothesls* 

Suppose he had uaed the ,05 level of significance. Would 
his sample^an, X » 93, be used as evidence for the same 
declslon~retentlon of the null? 

ans. No. Had he used the .05 level, X = 93 would fall 
In th,e region of rejection (see diagrainj)ne page 
back) since it is less than z = -1.96 (X = 94.12). 
Thus, in this particular instance, using the .01 
level would lead to a different conclusion than if 
the .05 level were used. 

The .01 level Is more conservative than the .05 level. 
You ai;^ more likely to reject the null with the latter 
level of significance. What if he made a less conventional 
choice, such as a ? .20? * 

With the .20 level he is being evfen less conservative ^ 
Now he will reject the null if his sample mean is determined 
to be 20% probable. If you bother to compute what a - ,20 
means In terms of z score values and real score values (in 
this example study) you would find that: 



t 



Region of rejection: ^ z« 1.28; z -^-1.28 

' . , :^-^ jL03.84; X- 96.16 

Thus, with a « .20, he would /eject the null If his sample 
mean was greater or equal to/l03.84 or leas than or- equal 
to 96.16. Obviously, th^^iteria for rejection in this 
case are much less stringent than if .05 or ,01 were used. 

Basic rule: The greater your level of significance (.001 
<.01 < .05 < .10, etc.) the less stringent 
(the less conservative) your criteria for 
rejecting the null hypothesis. Think about 
.it! 

Relation between significance* level (a) and Type I and 
II errors 

If you can apply the basic rule stated above to the previous 
information concerning Type I and II errors, the relation 
between ji and these errors should become apparent. 

The basic rule suggests that as a increases, your l;lkeli*hood 
or rejecting the null also int:reases., ^ 



251 

26o 



Therefore, ae a Increases, the likelihood of comitting 
a Txpe I error increases. Why? If you are being lifess 
stringent In your criteria for rejection, you will be more 
likely to make a mistake by rejecting a null that is 
actually true. Samples fluctuate, right? Frequently you 
are going to collect sample data which deviate from the * 
population mean, either overestimating it or underesti- 
mating it. If you use a large a (let's say .10 or .20) 
you are going to regard a fairly large percentage (10% 
or 20%) of possible sample means as '*improbable"<even 
though the null hypothesis might be true. Thus, a ^arge 
a means mote probability of rejection, and more suscept- 
«ibility to the error (Type I) of rejecting null hypothegies 
when they are true. 

Using tfie same reasoning, but in reverse, we can say that 
a0 a Increases, the likelihood of committing a Tjrpe II 
error decreases . Why? If you are being less stringent 
in your 'criteria for rejection (by increasing a ), you are 
going to reject more null hypotheses. Thus^ ther^ will be 
less likelihood that you will fall to reject a null that 
happens to be false (Type II error) . 

In summary, the relation between the choice of a and 
susceptibility to Type I and II errors can be described as 
one of "give and take." If you use a very conservative 
(low or small) level of significance (euch as -Ol), you run 
relatively little risk of making a Type I error, but a 
high risk of making a Type II error. On the other hand, if 
you use a fairly liberal level of significance (such as 
•10)f you run a high risk of making a Type I error, but a 
small risk of making a Type 11 error. Either way you go, 
^u win and you lose; the choice is up to you and the peopl 
who sponsored the experiment. 

Unit 8.3.2 Finale 

The last point to be made in tl^s particular instructional 
unit concerns an interesting jpspect of tHe relationship 
between a and the chance of making a Type I error. 

The interesting aspect is this: The chance of committing a 



What does this' mean? In "question and answer" form it 



Type I error is equal to a 



means this : 




252 




Question: A researcher uses n • .05 In testing the null 

hypothesis. Whet are his chances of coMlttlng 

a Type I error? ' 

Answer: SX (If you use the .05 level of significance, 
5 times out of 100 you will reject a true null 
hypothesis •) * 

Question: What If he used a - .01? ^ 

Answer: ' He would run a 1%'chance of committing a Type 
1 error (1/100 times he would end up rejecting 
a true null hypothesis). 



Question: What If he used a - .20? 



Answer: lie would run a 20% risk of making a Type I 

error (20/100 times he would end up rejecting 
a true null hypothesis). 

That should be simple enough to memorize, but It will 
require a little more mental effort to understand why this 
relationship holds. 

To use our continuing example - 
Suppose our researcher Is trying to determine wl\ether the 
average math scores of School XYZ students are equal to 100. 
He knows that those scores are normally distributed In the 
U.S. population with " 100 ando =15. He decides to test 

sample of 25 students from School XYZ since he simply does 
not have the resources to, measure the whole XYZ population. 
(Being a conventional person, he specifies his level of 
significance as a » .05.) 

The hypothesis testing procedure dictates determining what 
the Sampling Dlstrlbutiofi of means (size 25) from the XYZ 
population would look like if the null W£re true (that Is, 
aj^2 ■ ^00) • "^^^ ^® simple enough: E(X) - 100 and 
0^-3. A sketch can be made of the Sampling Distribution 
as follows: 




Since a - .05, we know the region of rejection will correspond 
tokhe areas beyond 2 - +1.96 and below 2 - -1«96 (this 



253 



<?6*o 



will always ba tha caaa whenever the .05 level of significance 
Is used) • H« cab "resketch'' the Sampling Dlatrlbutlon to 
show these .areaa of rejection. 







/ DO 


N6T\ 










JECT 














X: 




100 


/ossi 


z: 


rf.U 


0 





In order to see why the probability of making a Type I 
error is » to a, assume that in this particular instance 
the null hypothesis is actually true; u^^j; does = 100, 
and If the researcher had the time to test every single 
student, he muld have found this to be the case. But» 
since he Is restricted to using a sample of 25 , there Is 
no guarmitee, due to sampling fluctuation, that his sample 
data will be reflective of this true parameter value 
(0^,-100). 

Now ask yourgelf the follo^ng question: Given the fact 
that fBCt unknown to the researcher) , 

and the tact that the researcher has chosen the .05 level 
to test^ significance, what are the chances that his sample 
data (X) will fall within the* region of rejection, and thus 
cause him to reject a true null hypothesis? 

A long question, true enough, but the* correct answer is 
fairly brief; The researcher has a _5% chance of rejecting 
the true null. Why? The explanation can be found within 
the Sampling Distribution shown above. Doesn't that 
Sampling Distribution represent the probabilities of 
obtaining sample means of different values, based bn sam- " 
pies of 25 cases, which are selected from a population 
where u^^ = 100 and o = 15? Another long question, but one 
that merely states a definition you should have mastered 
long ago In Unit VII. Given that definition, what are the 
c;hances of obtaining a sample mean that will b^.^iea^jc 
105.88 (2 » +1.96) or less than 94.12 (2 = -1.96)? A^ain, 
tl^e answer is 5Z; 57T00 samples of 25 cases will yield 
means that fall into one of those regions. 

Thus, whenever the null Is true, the probability of 
rejecting it and thus committing a Type I error will be 
equal to the a level. If the researcher had used the .01 
level, the chances of error would be 1% and so on. Think 
about this in reference to the Sampling Distribution. Try 
to understand why this occurs before resorting to sheer 
memorization. > 



ERIC 



254 



Final notes 

1. A Type I error refers to the rejection of a true 
null hypothesis. 7 

A Type II error refers to the retention of a false 
null hypothesis. 

2. Probability or significance level, synibollzed as 
a (alpha). Is used In research to specify how 
"Improbable" a sample mean must be In relation to 
the parameter value In question iu^) In order to 
Justify rejection of the null. Conventional selec- 
tions for a arei the .05 and .01 levels. 

3. The greater^e probability level, the greater the 
chances o^'^rejectlng the null, and the greater the 
chances di committing a Type I error. 

The smaller (he probability level, the smaller the 
chances of rejecting the null, and the greater the 
chances of making a Type II error* 

4. The chances of making a Type I error are actually 
directly defined by a • If a- .05, the chances 
for a Type i error are 5X; for a " .01, the 
chances for a Type I error are IX, and so on. 
*The chances of making a Type II error, although 
related to o , also depend on a number of other 
factors, and therefore cannot be determined as 
readily and directly. 

8.3 POSTTEST (answers in back) 

1. Define and differentiate between Type I and Type II" 
errors. 1^ 



2. Discuss the rationale for specifying a level of signi- 
ficance (o ) prior to the conduct of your experiment 
or study. 



255 



) 



Describe the relation between the choice of a and the 
chances of committing Type I and Type II errors. 



A researcher uses a « .05 to test the hypothesis that 
children born In November are taller than average. 

a. What Is the chance the he will comnlt a Type I error? 

b. What Is the chance that he will conmilt a Type tl 
error? 

(Note: If you feel that a question cannot be answered , 
use the notation "NAi") . ^ 

The researcher t after measuring a sample of 46 children 
born In November , decides to reject the null on the 
basis of the sample data. 

c. Which type of error might he be committing? 

If » before selecting his random sample, he decided tb 
use am ^10 Instead of a« .05. what would be^ the effect 
on his chances fot;: 

d. a Type I ernr? 

e. a Type II error? 

4 Instructional Unit ; Testing hypotheses when population 

parameters are Rnown: the z Test 

You probably will be pleased to hear that once again, 
you will be dealing with a "new" unit that really does 
not involve anytl^ng "new,'* provided that Unit 7.1.2 
hai been understood (to some extent). In the next few 
pages will be described, mostly thtough examples, the 
exact procedures to be used for testing hypotheses 
when population parameters (a^ and a) are known (as 
has been the case in all examples given thus far in 
this unit). For the most part, this will be review, 
with only very slight changes from what was discussed 
in Unit 8.3. 

THE FOLLOWING TEST IS USED FOR COMPARING A SAMPLE 
MEAN TO A PARA^TER VALUE. 



256 . 

27i 



Stw for hypotheglg testing (using the\ teet) ^ 
!• State your hypotheses. 

For example: Ho : Uv - 79 Note: In this test we are 

testing the hypothesis that 
H^: Ux ** 79 \ ° parameter value, 79. 

2. Select a level of significance.* 

For eicample: a " -OS ^ 

3. Given your level of significance, determine the region 

of rejection In terms of z score (you may actually 
sketch the appropriate Sampling Distribution, but. In 
actuality, such extra effort is not really needed). 

For example: For a ?■ .05: region of rejection >4-1.96z 
and <-1.96a: 



(By the way, this is always the case. 
Whenever the .05 level is used the 
"critical" z's will be +1.96 and -1.96. 
Why? Simply because the areas beyond 
these two scores will always account for 
5% of the area in a normal distribution. 
If you are dlshelieving , check the "area 
under the nonmi curve" table.) 

For example: For a ■ .01: region of rejection ♦ 
2. +2.58Z and < -2.58z. 

(This too is always the case when the .01 
level is used. Why? Because the areas 
bieyond these two t scores yAH/ahaaya \ 
define 1% of the area in a normal distrl- 
g bution. Check the "area under the normal 

curve" table and discover this basic truth.) 



Go out and collect your sample data. 

Using the z test, determine whether your sample mean 
falls within the region of rejection or the region of 

The test may sound new, but it should be familiar. 
» All that it involves is finding the z value of your 
sample mean In relation to the parameters of the 
Sampling Distribution, E(X) and o^: 

z - X(sample mean) - E(X) 

Then compare this z value to the critical z value you 
deteralned In step #3. 



4. 
5. 



257 



27^ 



If the z score for the sample falls within either 
region of rejection » you will reject the null. If 
it does not » you will retain the null. 

Having completed these steps (which do y not need to be 
memorized — Just understood), you have done all that is 
required for this test. Let's try some examples: 

Example il^l (this one will involve a test using pieces of 
rope, rather than people, as subjects in the 
study) 

A company manufactures rope whose breaking strength 
averages 300 lb (u^ ° 300) with a standard deviation 
of 24 lb (o - 24). It is believed that by a newly 
developed process the mean breaking strength can hm 
increased. A researcher is hired to investigate tm$ 
hypothesis. UnfortunaH^ly , he does not have the time 
/ to measure Individually each of the 10,000,000 pieces 
of rope manufactured by the new process, so he decides 
to use a rando^ sample of 64 pieces. 

He conducts his hypothesis test as follows: 



1. States the hypotheses 



a* 



a - 300 

Uy ^ 300 



(Note: We are testing the hypothesis 
that Hq • the parameter value, 300.) 



2. Selects level of significance 



- .05 



3. Determines the region of rejection 

For the .05 level: region is greater than 

-fl.96z and less than 
-1.96Z /" 



A sketch of this, although not needed in reality, r 
would lo^ like: 




+1.96 



ERLC 



258 



27. 



For Sanpllnt Distribution of meant, based on 64 ca^: 

E(X) - 300 ^ 



- 24/ V64" * 3 



4. 



5. 



ect your sample data 

The researcher tests the 64 pieces of rope 
selected for his sample and_flnds that their 
average breaking strength, X, • 306 ^ 

Use t,he z test to determine whether sample mean 
falls within teglon of reJectiC^ # 



Z - 306 - 300 
3 



2.00 



Since the z score associated with X « 306 is 
greater than the critical valu^, z -.1.96^ the 
null hypothesis is^ rejected. 

The researcher concludes that the new method 
produces significantly stronger rope than the 
original method. 



QUESTION: What type of error cou 
ANSWER: 
QUESTION: 



have been cotmnltted? 



A I'ype I error^ rej^cting^ a tr|ic^ullSiypo thesis 

What la the problblllty t{ij^ a Type I error was ^ \ 
committed?. ^^^'^^^^-J^.^^ 

ANSWER: 5% (.05), since the chances of a Type I equal a . 

QU^TIQN: What would the procedures have been if the .01 
^ l^vel had been used? 

ANSWER: The procedures ^woAild have been identical w^th 
the only change being the region of rejection. 
When the .01 level is used, the region of rejec- 
tion involves the areas beyond 2.58z and -2.58Z. 

The decision, using a " .01? 

Retain the null: Our computlt z value, 2^00, 
does not fall within the region of rejection for 
the .01 Itevel. 

^ What type, of error is possible? «^ 

Type II In this case; failure to rej^^ct a false 
null. \ ' \ 



\ 



259 



27.i 



The chances of that error? 

Impossible to compute id.thout much more infor- 
mation; only Type I error probabilities can \ 
be determined directly. 

0. , then, what are the chances of a Type I 
in this case? 



There Is no chance; you cannot make a Type i 
error when you retain the null. 

>you* should be able to see that it is possible reach 
^oppodite concisions pM the same exact experiment , 
depending upon which leA^el of signlficAice J.8 used. 
That is why the researcher should select a before 
collecting his data (i.e'i, doing the actual study or 
experiment) . • ' * 

Last Example (This one will be done much more rfuickly) 



A^hospital administrator wants to determirif whether the 
yearly incomes of orderlies in his state -compare favorably 
with the national average, Uy^ = $12,000 with a = $1,000.* 
He decides to use a random sample of 25 orde^ies selected 
from different hospitals In his ^tate. He also decides to 
u^e a = .01' for his significance test. 

His hypotheses: Hgi a^^ = 12,000 

H^; Ux 12,000 

The Sampling Distribution of means, based on 25 cases, will 
h^e: 

E(X) ,= 12,000 • ;> 

Oy = 1,000/ =200 

* ' V2r 

Use of ^:he' .01 level p^c^s the region beyond + 2.58z 

> / 





Z: -2.58 



0 < . +2.58 . 



He goes out and tests his sample.. He flnds^hat X= $11,200 
for this f)articular group. 



Is^hat : 



260 



By use of the z test, we determlfie how this* value relates 
to the parameter (a^^ = 8,000) Sampling Distribution of 
means • 

z - 11.200- 12.000 = -800 = -4.00 

200 200 \ > 

With this calculation, it is ^ clear that X is ^11 within 
the reigion of rejection. The adniinistrator is forced to 
conclude that hospital orderlies irk^his state are under- 
paid relative to the national average. * 

What type of error is possible h6re*? 

A Type I: rej^^ction of a true null hypothesis. There is 
always the possibility that the average in his 
state is $12,000, but the sample he selected was, 
by chance, exceptionally low in income. There 
is always a chance for error whenever sample 
data, rather than population 'data, is used. 
The .01 level of significAnce sets theT chance 
of a Type I error\at 1%. 

8:4 POSTTES"!,^ (answers in back)* 

1. The Scholastic Aptitude Ttest (SAT) yields a 

national mean of 500 and a standard deviation of 
100 for hi^ schorfl seniors. "The guidance coun- 
selor at Happy Valley High ^h|Dol selects a 
random sample of 100 seniqfs tb determine "Whether 
the school SAT mean is comparable t!o the nWibional 
norte. I|e finds that the mean Sik; score of hi6 
sample 'is 482. 

a. State the null hjrpothesis ^ \ ^ 

b. What is * * 

c. What is the z equivalent of the Happy Valley 
mean of 48^? » 

d. If a .05, what should the coliclusion of this 
•study Be? ' * ' 
In dram^ th^yabov^^conclusion, what type of 
errOT ifi pi^^^sibj^-^? \ ^ 

f • ^ppose the mean scorl of the ^00 students 
vas 520. Using the ^p5. level, what woul3 you 
conclude? 

g. In drawing the concision in f.,.what type of 
error 'do you risk? 

.2. A manufacturer of flashlight jDatteries Reports 

that his batteries^ have a meanHjl^e 0^25 days y 
with a standard deviation of 5|ijd|rs. A consumer 
testing organization purchases ia^andom sample 




27u 



of 100 batteries and has them used. The mean life 
of^ the samplC batteries proves to be 23.5 days 

(X » 23.5). ; • 

I 

a. State th|e null hypothesis 

b. What Is cfc? % y 

t ^ 

'i ^ ' . " 

c. What is phe z equivalent of X? 

What Is the decision If a = -05? 
e. In making t>he decision In d. , what typ6 of , 

error Is possible? 
f\ What is the decision if the significance level i 

is ^.01? \. 

g. Suppose that a sample of 25 batteries was used. 
If a = 11)5, what is the decision? 

h. In ^king the decision in g. , what type of 
errat do you risk? 

1^ BONUS: How is the probability of a Type I 

error influenced by reducing the sample 
1. size from 100 to 25?, What about a 

i Type II error? 



8.5 instructional Unit ; Testing hypotheses when population 

I parametelrs are unknown: the tes^ 

! 

In this unit, we Will cover briefly another procedure 
for hypothesis testing. If you have a superhuman 
memory, you might irecall^thalt in-qvery example pr<^blem 
covered .in the last few units, population parameters 
(u^ and a)^ were provided and known to the researcher. 
For example, if the researcher wanted to compare the 
IQ3 of adolescents' in Juvenile Hall to the U^. average, 
he knew that IQ in; the U.S. had u^^ = 100 and a « 15;' 
if he wanted to test the strength^ of a new type of rope 
with the standard type of rope, he knew that the stan- 
dard population had Ux = 300 lb. and a « 24 lb.; if he 
wanted to determine whether hospital orderlies in his 
state earned mor^ money than average, he knew that fpr , 
the whole U.S^.^ population of orderlies, $12,000 
and a » $1^000. Check these out and you will find 
that in every singly example, population parameters 
were known. When^'they are known , the z test described 
in Unit 8.4 is perfectly appropriate for tise in hypo- 
thesis t^^lng. i > 

1 0 

\ 

Unfortunately, 'in most Instances, population 
parameters will not be known. A teacher, for example,/ 
ik^y wish to test thej hypothesis that the average IQ In 
her school is 100, but be unaware of the population 
^ mean and standard deviation. [Can she go' ahead /uid test 
heJ: sample by use of the Z test? The rules dictate 
tmt. she may ,not, and if you check the formula you 



vlll realize that its applic£rtion in this case would 
be impossible. In order to use z, you ntust know the 
o- for the Sampling Distribution; in order to know 

the value of Oj^, you must be ible to divide a (the 

population standard deviation) by the square root of 
n« Does the teacher know the value of a? According 
^to the above description, she does not; as far as 
the z formula is concerned, she is finished before 
she starts. 

Basic Rule: When population parameters are unknown, 
the z test is inappropriate. Test your hypotheses 
by use of the t test. « 

By now you are probably mumbling nasty words to your- 
self given the thought of being required to learn a 
whole new procedure. Chin up — the t test is almost 
exactly the same as the z test with two very slight 
changes: 

(1) You will have to estimate g since it is not 
provided. This is done very siaply by 
computing 6 for your sample (see Unit III), 
and using it as the population standard 
deviation. Actually, given nothing else, 
it is the best estimate available. 

•"Will I have to go through all the work to 
compute 4?" you may ask with fear and ^ 
trepidation. 

No. In this unit, 6 will be provided. In 
r^nJ. life it will not; then you will have . 
to compute 6 using the procedures covered 
in Unit III. 

The formula for as compared to z, is as 
follows : . 



z « X - E(X) where = a/ 



F 



t » X E(X) where-6y = -6/ 



263 276 



(2) The second minor change is that you will have to use a 
t table, instead of the "area under the normal curve" 
* table, to find the critical values (i.e., the cut-off 
points indicating whether the null should be retained 
or ^rejected) . 

Find thfe t table, Table II, in the Appendix. Note that the i 
table is partially arranged in terms of col^jumns corresf^onding 
to different probability values (levels or significance). 
If a = .01, use the .01 column^ if a = .05, use the .05 # 
column. I 



57 



The rows on a table correspond to what are called 
degrees of freedom (d|{) , the rationale for which will 
not be given h#re. No need to get excited: = n - 1. 

Thus, if there are 25 subjects )n your sample, you find 
the row corresponding to djj =» 24; with 20 subjects, look 
for djj = 19; with 60 subjects, look for d(J = 59, and so on. 
If a dj{ is not tabled,, use the next available lower 
value . 

Some Examples ; Find the critical t value for a study in which 
n = 4 and o = .01. 

Ans ; t = 5.S4 (diJ = 3) ^ 
if 

In the same study, a » .05. ^ 
Ans : t = 3 . I S 

What about ^when n = 24, and significance level = .01? 
Ans: ^t, = 2. SI [di =^23) 

If you cannot find the above values on the t table, seek help 
immediately Keep in mind that these values will be used to determine 
the r^gip^ of rejection, just lil(te in the case of 2 values. If the t 
values ypu compute exceeds the critical t value, you will reject the 
nuLl , ank so on • 

O.K., now gather y^urs&^ises and remember: ^ 

1) If populatiojiVparameters are not given, use the t formula 
rather than 2 lormula for comparing a sample mean to a * 
parameter value . ' 

2) Using t is almost exactly the same as using 2 except: 



1 



264 



27-j 



a) the swnple standard deviation, 6f is 
used as the denominator in the equation 
(see farmula), and a t value is computed 
as in the z formula. 

V b) Instead of using the "area under the 
nonn^l curve" table, you use the 
table to find the critical value, 
against which the computed^ score 
(step a) is compared. 

Example Problem : A teacher wishes to determine whether 
the average IQ in her class Of 300 
students is above 100. She doesn*t 
have the time to test* all 300, so 
she decides to use a sample bf 25 
I (n B 25). A researcher recommends 

that she use the .01 level of signi- 
ficance in her hypothesis test. 

Question: Which test should she use? * 

ans. The t test because the population and standard 
deviation are not available. 

Question: Iftiat are the chances she will make a Type 
I errc(r ? 

ans. 1% because she is using the .01 level; the t 
test does not change this basic rule. 

She states her hypotheses as, follows: 

H^: « 100 

H^: Ux ^ 100 
(this Is always the^ame whether you use or z) 




determines the region of rejection using the t table: 



;t « 2.80 (when n = 25, = 24) 

Thus, if her computed t value is greater than or equal to 
t « 4«2.80, or is less than or equal to t » -2.80, she will 
reject the null. The areas equal to and beyond +2.80't 
and -2.80't constitute the region of rejection. 

She now goes out and tests her sample of 25, keeping 
in mind that since o is unavailable, ^he will need to 
compute 6 (the standard deviation) of the sample data. 




265 



For the sample tested^ X » 104, and 6^ 10. 



Using the ^ formula: 

t » X - E(X) where 6 y = -&/ =10/ - 2 



» lOA - 100 = A/2 = 2.0^ 

2^ 

Does she retain or reject? 

She must retain since the computed t value, +2.00, does 
not exceed the critical < value of +2.80. Her sample 
data falls Into the region of acceptance. 

What type of .error Is possible? 

ans. Type 11 

Last Example 

A llghtbulb manufacturer re^ds In Consumer -Reports 
that a good bulb should last\for_52 hours. He decides 
to test his bulbs against this recommended standard 
using a sample of 16 bulbs. Being conservative, he 
decides to use the .01 level of significance for his 
hypothesis test. 

He states his hypotheses as follows: 
Ho* 52 

The t test must be used since the population standard 
deviation, a. Is unavailable. 

The critical value for t when « 15 (n-/) and a > .01 
Is 2.95 . ' Thus the area of rejection Is that equal to 
or beyond +2.9S£ and -2.95 ^. ^ 

He tests his sample of 16 bulbs and finds X » A2 and 
4 » 12. 

What Is his decision? 

The decision must rest upon the computed t value: 

^A2 r 52 « . -10 - -10' » -3.33 * 
12/ Vii 12/A , 3 



The computed t value falls beyond tfie critical value, t - 
-2.95 (or +2.95). ^ 

The decision must be to reject the null; the lightbulbs 

manur^tured by this company do not meet the standard of 
52 hours of life. 

What type of error is possible? 

ans. Type I; anytime you reject the null there is some 
chance of committing a Type I error rejecting 
a ••true" null hypothesis. 

Notes : 

1. Use the t test when population parameters are 
unknown. 

2. The t test is almost identical to the 2 test 
except: 

a) You will need to compute 6 (standard deviation) 
for the sample data. (On example problems, 
however,^ will be given). 

b) Use 6 as an estimate of o in the denominator 
of the t equation. ^ 

c) Comput;e your t value the same w^y as for z. 
The critical value against which your com- 
puted value must be compared is to be found 
in the t table, by finding the column corres- 
ponding to yout level of significance, and the 
row Corresponding to your (di = n-J). 

POSTTEST (answers in back) 

1. Find the critical t values when: 

* a. w - 23 anda« .01 b. w - 23 and a« .05 
c. n - A and a- .02 d. K » 61 and a« .001 
e. w » 11 and a= .05 

2. Looking at the t table, what is the relationship 
between the critical t and: 

a. n? ^ 

b. a? 



267 - 282 



A school teacher reads In the newspaper that the 
average second grader shoul'd be able to run the 
40 yard dash In 15 Seconds. She decides to test 
her own second graders by timing a sample of 9 
children. 

a. Which hypothesis-'testing procedure should 
be used, Z or t? Why? -ic 

The teacher is told by the schooH^^ychologist 
to use .05, * 

b. What are her chances of making a Type I 
error? a Type II error? 

i * " 

c. What critical value defines the region of 
rejection? 

She goes out and t^sts her sample, finding X « 

12.5 and ^ » 3. j 

ft 

d. What is her decision regarding the null? 

e. What type of error is she in danger of 
committing? 

ji' 

Suppose she had used the .01 level instead qf 
the .05 level of significance. * 

f. What would her decisi<<jn be? 

What type of error would she be in danger of 
committing? 



268 



28o 



Unit VIII Review Test 
(answers In back) 



1. A researchejr does a study to test the null hypothesis, U;^ = 197, 
He obtains X « 230 which Is sufficient to reject the null. Can 
he conclude that the population mean In question Is definitely 
tlgher than 197? Explain. 



2. What type of error (I or II) might have been made In the above 
example? , t 

" ' ' ' " , 

3. Based on the decision in ifl, which- one(8) of the following alterna- 
tive hypotheses would be supported? * » 

a. H.i= 197 c. Ha * 197 „ ^ 

b. «a- 230 d. Ha > 197 ^' "'^ ^^i 

4. ^Whlch of the following a would best protect against a Type II error? 



.01 b. .10 ^. .(^ d. .001 



5. ^ Which of the above alternatives would best protect against a 

Type I ^rror? ^ 

6. Describe the logic under which hypothesis testing works (Hint: 
analogy to law) . 



7. A researcher falls to teject the null using a » .05. He reasons 
"Had I selected .01 as the level I would have rejected It — too badi" 
Does his reasoning make sense? 

8. The average dally temperature In July In Newtown (at 12:00 noon) Is 
80 degrees^ with o « 8 — based on town records. Fanner Jones thinks 
It's getting warmer over the years, so he measures temperature* on 

100 randomly selected July days In a 4-year period. .^He obtains X « 86, 

a. What Is Farmer Jones' null hypothesis? ) 



269 . 28 i 



b. If he ^ecid€t»-^0-t:est^l:t~'at a » .0^5, what is the rejection 
region (I.e., critical values)? 



What Is the computed z value? ^ 

d. What Is the decision? ^ 



e. What type of error Is possible? 



Suppos^l^ou wish to test « 62^ at o «^,.01, using a sample of 
n » 25.¥sample statistics ai;e X « 68 and 4 » lA. 

a. Why can't the z test be used? 



b. What Is th^ computed t value? 

c. What Is the tabled t value? 



d. What Is the decision regardlnjg the null? 



270 28o 



,unit1x 

SELECTED INPlBRteNTIAL TECHNIQUES - ") 

A, General Objectives 

This unit extends the previous one by presenting other (more commonly 
used) techniques for making statistical Inferences, The first two to be 
presented allow us to test hypotheses regarding the difference between 
two population means, asking, for example, "Does one population of Indi- 
viduals have higher scores than another (perhaps as a result of a special 
treatment that one received) The tlhlrd technique extends our hypothesis 
-0 testing procedure to the correlation coefflcl.ent through a popular test 
for judging Its "significance" when computed from samples. The unit 
closes filth a discussion of ^ somewhat different, perspective on Inference- 
making: Interval, as opposed to point, estimation using confidence Intervals > 
Be "confident" about your enjoyment 'of this material. 

B. Specific Objectives . 

9.1 Test th<^ Difference Between Two Means 

9.1.1 The t test using Independent samples ^ 

9.1.2 Th^ t test using dependent samples 

9.2 Test the Significance of the* Correlation Coefficient 

9.3 Use Confidence Intervals for Parameter Estimation 

9.1 Instructional Unit ; testing thi Differences Between Two Means 

The two hypothesis testing procedures you have learned thus far both 

involve making Inferences about a single population in terms of whether 
itsw mean conforms to some parameter value. Thus, what basically takes 

plaice is that a representative sample is selected and its mean is com- 

put'ed; The mean is then compared to ttie hypothesized number value; e,g. 
Wo : a « 100, or Hq : a « 569, etc. If is known for the population, 

the Z statistip^is used for the test; if 0^ Is not known, the t statistic 
Is used, i ' 

But how many times will the researcher have an exact comparison value 
in mind? ^Very rarely will that be the ^se and typically only when he 
is dealing with well-known measures for which standards (or ndrms) haV^ 
been derived. For example, if we are testing students in SAT scores we 
can hypothesize that they will do better or worse than ,the national 
average which we know to be approximately 500, Similarly, we may wish 
to compare the mean for a selected group of individuals to known stan- 
dards for such variables as height, weight, ACT scores » or body tempera- 
tures, etc. In such instances, we can safely turn to the 2 or the t 
procedures previously examined, • 



271 



28u 



ERIC 



Most hypothesis tests, however, involve comparisons between treatments 
that have never been used before in the exact same fashion. For example, 

A one may wish to compare the effects on achievement in a math class of 

lecture 'instruction vs. programmed instruction. There would be no nationals 
norms telling us what the achievement means should be using those proce- 
dures, with those types of students, in that particular class, on that 
particular achievement test, etc. What we are really asking is, woUld 
the total populati'on of students who receive lecture ii\^that class J>er- 

^\ form differently than the total population who receive programmed instruc- 
tion? If the answer is yes, then we have a basis for recommending which- 
ever method came oiit better to those who teach similar courses. But can 
we test the total populations who woufd be eligible to receive those 
methods? Definitely not..., so as usual, we will use samples to make 
inferences about the populations. The following sections will illustrate 
two analytical designs for doing this: the t test for inJ^endent 
samples and the t test for dependent samples. 

9.1. 1 The t Test Usl^ng Independent Samples 

In this first case, we wish to compare two means obtained from 
samples selected completely .at random from th^ir respective populations. 
Who makes it into Sample #1 has no' influence on who is selected for Sample 
#2, ahd vice versa. We'll see in thejagxt section (9.1.2) how a different 
design involving "matching" subJectQ^cro^s samples can be employed. 
% ^ Y ^ 

Let's illustrate the independent samples design using the example 
involving the comparison between lecture and programmed instruction. 
We randomly select a sample of subjects to represent 'the total population 
who^1l(|ght receive programmed instruction (PI). We do the same with 
regard to lecture. Who is picked for PI has no influence on who is 
picjked Ifor Lecture — it's the luck of the draw. Thus, one sample will be 
given Ipcture, and the other programmed instruction, He^je is how the 
null aqd alternative hypotheses would be set up: / 



9 




I Nd'^^ieed to p^nirc about these, as they indicate exactly what we would 

expect them to.^ The null (Hq) says thAt differences bfetweeh.the popula- 
tion, that receives .lecture instruction and the population that receives 
programmed Instruction are equal to zero (no difference between theml). 
The alternative (H^ ) says that there are diffOTencea, meaning that one 
popu^a^on feHorns higher than the^othet. Baseti on the results we 
' obtaj[]h! • from bur samples, we wfll either retain the null or accept the 
alternative. The formula that is used to test these hypotheses (n6w 
brace yovjirself) is as follows: ^ 

• Xi - X2 ^ 




ERIC 



(wi - + (^2 - 1)42^ 

Ml + n2 - 2 \ni 

\ ' ' • ' 272 ^ 



The-Tiumerator of this equation simply represents subtraction 
of one sample mean from the other. That part Is easy.' 

The denominator looks quite a -bit more complicated, but the 
mathematics involved, particularly with ^a^lculator handy. 
Is really quite simple: 




61^ Is the unbiased variance est^^llate from 
sample #1. Remember the variance fronJ 
Unit III? Unbiased means that w-1 i,8 used 
In Its derivation rather than w. , Ch^cftv 
back to Unit III If all .this seems mystetlou^ 

is the unbiased variance estlmat^^rom 
sample jf*. Remember, we are using samples 
to make Inferences about two populations. 
Thus, variance estimates are needed from 
both samples, rather than from only one as 
was the case the first time thet statistic 
was used. 

m & W2 are the sizes of samples 1 and 2. \ 

* 

It would ^e beyond^he scope of this text to try to ex- 
plain how the above fotmula is derived (hopfe you're not too 
upset...). Perhaps it would suffice to say the following: 
If there was actually no difference between the effects of 
the tt90 treatments, and samples of sizes n\ and ni were se- 
lected from the pbpulatlons receiving the treatments, the 
Exjpected Value of the difference between samples would be 
ziro. But, note that Just like in the case qf the other hy- 
pothesis tests we have covered, some of the pairs of samples 
would show a difference - due to sampling fluctuation. When 
all difference scorea (from an infinite number sample 
pairs) are each dfvlded by the error term shown in the denom- 
inator above, the resultant values will be distributed as t 
with - + n2 - 2. The t probability ^able for that par- 
ticular di shows the likelihood of obtaining certaltf t values 
(undef^ the assumption tl|^t the null, is true). The lo^x 
hasn't^ changed from that underlying the two hypothe^is^tats 
dlscus'aed earlier. As an example, suppose that the sampjLe 
size for tfie programmed instruction treatment is w - 33> while 
that for the lecture treatment is w - 29. To compute djj for 
the hypothesis test, we apply the formula, n\ + W2 2, s 
to get 33 + 29 - 2 - 60. Looking at the t table, we find that 
1^ the null was true (i?.e., there are no differences ifi ef- 
fectiveness between programmed instruction and lecture for the 
"population"), the chances of getting a t ratio as extreme as 
+ 2.00 would be 5/100; for a t ratio as extreme as-i2.6fr, . 
they would be 1/100; and so on (make sure that you cmW see 
%ihere these estimates come from on the table). So, dnce 
again we are asked fco obtain a sufficiently "extreme" outdotoe 

273 c ^ ^ y-^ 



A 



* in order to justify rejection of the null in favor of the 
conclusion that the treatments (probab}.y) d£ differ in 

effectiveness. x ^' — — . ' 

•. • * 

Confused? That's very possible, as perhaps we've ma^e 
* _ a .very* simple procedur-e sound rather mysterious and complex. 
Let's try to cl^an up any misconceptions by considering a 
concrete example: 

Example . 

A re&earcher believes that for 10th grade math students 
in Fly High, programmed instruction will result in better 
end-of-year testuscores than will lecture instruction. Mrs. 
. Smith's third period section, having an Sirollment of 55 
students, is randomly assigned to the two treatments; re- 
sulting in 27 students being taught by programmed instruction 
and 28dby lecture instruction. At the end of the year, the 
f * ^ same examination is given to 'both groups to enable the com- 
parison to be made between_methods. The results show^ the 
following: 

i 

Ptogrammed 



n A 


X 


a2 








27 


90 


26 


28 


84 


32 




Lecture 

Before going on, make pure you understand what would be in- 
volved in putting \he above table together. The dei|ivation 
of n should' be obvious, but if» as_a result of some "memory 
.problem," you have forgotten what X an* 4 2 are, make sure ^ 
that you look them up (Units II and III, respectively); also 
remember .that -62 is. deriveii by using n - 1 in- the denominator 
of the- variance formula - it is used to estimate a^. ^ 

O.K., back to the problem ^ hand. The results lookr'syg- ^ 
gestive, but if an the researcher conclude that^ programmed 
instruction was significantly more effective than lecture 
instruction?. Firfet,^of course, we need a significance level 
for the test. Sticking with convention, we'll say that the 
fe^Q4rcher is content to go with a = .05. The next step is 
to compute a t score and* determine whether it falls within 
or outside the ".05'' rejection region for di « 28 + 27 -2 = 53. 
Wfe'^1 caiythie Programmed Instruction sample and the 
leGturidaraiple no ^which labels are used for the two samples 
doest^l^af feet the workings of the ^^rmula) . Substituting • 
the obtained values ip the t formula 8h6Vn t\i% pages back gives 
the following: 



274 



ERLC 



90 - 84 



(27-1) 26+ (2fr-l) 32 
27 +28-2 



\27 28 / 



/(26) (26) + T27) (32)/ 1 + l\ 
y ^ ^ ~~53 ~\27 28/ 



V676 + 864 / 1 + 1 \ 
ST \27 28/ 



V29.06 / I + 1 \ 
\27 28 / 



V29.06 + 29.06 
27 28 ^ 



6=6 



1.46. 



4.11 



275 



Before you continue further, mak^ sure that you know 
where the numbers that, were used In the very first expres- 
sion came from. Check them with the results of the study 
listed In the tabl? on page 274 and with the ^rlglnal for- 
mula shown on page 272. The different expressions that 
appear above represent a etep-by-step process of simpli- 
fying the original. When we are all done, we get ^ = 4.11 
for the comparison of the means obtained In the two treat- 
ments. 



The next step Is to ^ompare this value with the table 
listing for a » ,05, d(J = 53. The t table contained In 
the Appendix does not list any values for d|J « 53, so to 
be conservatlvift (I.e., avoid elevating the chances of a 
Type I error above 5%) , we use the next available lowest 
d^, which on that table Is 40. (If our computed t value was very 
close to the tabled listings, we could get a more accurate 
critical value by using a more detailed table or by 
Interpolating.) Looking at the values f^r d|( « 40, we 
find that a t greater than +2.02 (or less than -2.02) Is 
needed to reject at the .05 level of significance. .The 
computed t therefore falls In the region of rejection. The 
researcher will conclude that programmed Instruction yielded 
significantly higher test scores than did lecture Instruc- 
tion. What are the chances that a Type' I error was made? 
Give your^self a "gold star" If you were thinking 5%. What 
are the chances that a Type II error was made? Give your- 
self another gold star If you were thinking that such a 
question ia ridiculous. Type II 's can only be made when the* 
null Is retained. 

Here's another example ; • 

Mrs. Goldflngers, a school nurse. Is in^o the litera- 
ture om health foods. She decides on the basl^of her reading 
that having "Crunchy Cornflakes" foif breakfast will give kids 
"go power** In their physical education classes. She ton^ 
vlnces the principal to allow her to coTiduct an experiment 
using third-grade classes. The, experiment Involves serving 
"Crunchy Cornflakes" for breakfast -to half of the students 
In each third grade class and the normal scrambled egg dish 
to the other half. The significance level Is established at 
a a. 01. The students are then tested on the number of push- 
ups they can do in their physical education class that day. 
When the experiment Is comple.ted, the results obtained are as 
follows: ' * , 

Comflakea 50 IQ 9 .f 



/ Eggs v50 9 ,12 

I' 

276 



Can It 'cohcluded that cornflakes vnre more effective than 
eggs? 



ERIC 



9q 



t " 10-9 



/ (50-1) 9 + (50-1) 12 / 1 + l\ 
V 50 + 5a - 2 \50 50 / 



1 *Note, since 



7 



441 + 588 
98 



(i) 



nVs ate equal 
we can add the 



m^_+iM}_12 ( 2 \ ^„ uruUlpllers 



in the denomi- 
nator. 




Having +1.54 as the computed t value, what is the decision? 
To find degrees of freedom, we apply the simple formula 
50 + 50 - 2 = 98. The t table has no listing for d|5 = 98, so 
we must move down to the row associated with d|J = 60. Moving 
across the columns, we eventually reach the .01 significance 
level, which lists the critical t value as 2.660. Tj^e de- 
cision, then, is to retain the null. The experiment is 
inconclusive. 



277 



9.1.2 The t Test Pslng Dependent Samples 

* * 

The type of experiment illustrated in the previous section 
involved comparing treatments by applying them and analyzing 
their outcomes^ using independent samples selected ^om the same 
overall population. For example, one group might receive pro- 
grammed instruction and the other lecture instruction, but 
there would be no systematic relationship between the members 
of the two groups. In, this section, we will discuss a slightly 
different type of experiment whose uniqueness lies in its 
matching of subjects across treatments. For instance, in the 
case of the programmed instruction vs. lecture experiment con- 
sidered earlier, one might argue that an intervening variable 
such as IQ might be so powerful that ^ny differences between 
the two treatments would be "washed-out." The specific concern 
might be that, by chance - even though assignment to treatments 
was done on a random basis, one group would be higher in IQ 
than the other. Regardless of treatment,* then, that group 
might show the higher performances. To eliminate this pos- 
sibility, the matched group design to be discussed here would 
systematically assign subjects to treatments^ such that for 
each "high IQ" student assigned ta Treatment A tjiere would be 
a matched, high IQ student assigned to B, etc. There would be 
a number of different ways of doing this, but obviously, all 
would involve first measuring the prospective participants in 
terms of IQ. 

The "dependent samples" design can be advantageous in 
many instances. The most important criterion affecting the 
desirability of its use is the relation of the "matching" 
variable to the outcome variable. With regard to the above 
example, it aJiould be obvious that under normal circumstances 
IQ would relate significantly to the outcome variable in 
^ question, i.e., achievement on the lesson. Thus, IQ could be 
considered as a possible matching variable. But, would it do 
any good to match subjects on the basis of hair color, such 
that there would be a blond for a blond, etc.? Given that 
there would probably be no theoretical basis for predicting 
a'^relationship to exist between hair color (matching variable) 
and achievement (outcome variable), one would be doing a lot 
of extra work for nothing. Thus, the rule is that the* stronger 
the relationship between the matching and outcome variables, 
the stronger the justification for using a dependent samples 
design. When the relationship is strong, the payoff is a 
more powerful test of treatment differences, which is the 
whole purpose o^ doing the experiment in the first place 
(i.e., comparing treatments or conditions). The cost, however, 
is additional work (it*8 much easier^ to assign subjects ran- 
domly to independent samples) and a cost of roughly half of 
the degrees of freedom (diJ) that would be present were indepen- 
I dent samples to be used. The latter "cost" will become evident 
when you work with the formula to be provided later. But, if 
you* 11 look at the t tabl^ now, you will see that the greater 



278 



the for any a level, the lower the critical value that is 
needed to reject the null. ThuSt a loss in 44 without any 
compensation, would hardly be to the experimenter's advantage. 
Before moving to the formula for the dependent samples de- 
sign, test yourself on your understanding of the matching idea 
by identifying each of the following experiments as either 
Justified or unjustified in terms of the probable relationship 
between its proposed matching and outcome variables (cover 
the answers as y^^ go along) . 



Example A : 

Treatment 

Watching films 
vs. Active Playing 

Justified or not? 



Outcome Variable 

Shooting ability in 
basketball 



Matching 
Variable 

Height 



Ans: This dependent samples (design would probably be justified; 
Height, in most cases, would have a reasonably strong re- 
lationship with shooting ability. Thus, a stronger hy- 
pothesis test would most likely resuj^t from equalizing 
the two treatment groups such that a tall person in onf 
is matched to a tall person in the other, etc. 



Example B ; 

Treatment 

Watching films vs. 
Active Playing 



Outcome Variable 

Knowledge of rules 
of basketball 



Matching 
Variable 

Height 



Justified or not? 

Ans : Here the use of the "dependent" deij^-gn would be question- 
able. Height should not show much of a relationship with 
knowledge of rules following the treatments. One would 

V be going through some extra work and throwing away d^, 
without earning muth of^ a pay-off. 



^Example C ; 

Treatment 

Aspirin "A" vs, 
Placebo 



Outcome Variable 

Frequency of head- 
aches 



Matching 
Variable 

Amount of 
education 



ERIC 



Justified or not? 

Ans ; Probably not. While one could perhaps argue that college 
grads in "white-collar" positions might tend to ha(Ve more 
headaches that high school grads in "blue collar" posi- 
tions, it seems doubtful that such a relationship - assum- 
ing it really exists - would be very strong. Matching 



279 



234 



participants in terms of eduction would probably be wasted 
effort In this study, as random sampling would probably 
achieve a reasonable enough balance on the education dimension. 



Example D ; 

Treatment 



Aspirin "A" vs» 
Placebo 



Outcome Variable 



Blood Pressure 



Matching 
Variable 

Age 



Justified or not? * — 

^ ■ . . . « 

Ans ; Such matching would well be worth some consideration, as 
age should show a fairly consistent correlation with 
blood pressure. If, by chance, random sampling resulted 

U In one group being older than the Other, one could obtain 
differences In this experiment that do not reflect the 
actual effects of the aspirin vs. the placebo. 



Example 

Treatment * 

Film watching 
vs. Reading 

Justified or not? 



Outcome Variable 

Score on a history 
exam 



Matching 
Variable 

g^ade point 
average 



Ans •/ Such matching would seem Justified, as grade point 

average would probably show fairly strong relationship J 
with examination scores. 



HAD ENOUGH? Let's move on to the formula for the dependent 
samples design* Here's how to compute 



V 



\ 



Zd2 



n In- 1) 
where - Ep2 -(Zp)2 



ERIC 



280 



f 



DON'T PANIC! This formula Is actually fairly easy to work 
with once you know the meaning of "P/' Here are some defi- 



nitions: 



ERLC 



P; Is the difference between the scores of a sub- 
ject in Treatment-1 and his/her matched counter- 
part in Treatment-2. This variable should become 
clearer in the example to be presented below, but 
for now imagine that John has been matched to 
Sally in IQ, and each receives a different in- ' 
structional treatment • John obtains a acore of 
80 on the criterion measure (the final exam), 
while Sally obtains a score 'of ^75. 

For this particuiaj: matched pair, P » 5 

(pr -5, depending on'^^fe^l^h treatment is listed . 

first). 

Zp2; iQ the sum of the squared difference scores 
(P's) adross all matched pairs. 

(EP)2: is the square of the sum of all difference 
scores (P's), 

Ztfii isr^eded for the error term; it is obtained by 
' th^ pottom formula on page 280, 

is the average difference between matched pairs • 
n: is the number of matched pairs. 

If you have a good memory, you might recall that the for 
mula for obtaining Z(fi is operationally Identical. to that used 
for calculating the variance^ as was shown in Unit III. No 
surprise; what we are doing here l£ the same thing, except 
we are getting a variance estimate for "difference scores," 
rather than fbr plain old X's. 

So, •at the risk of getting you con^useTd with "words," 
use of the formula involves computing ^ by: 
« 

(a) determining the difference between the scores 
of members of each matched pair. The result 
is a series of P's, one for each pair. ^ 

(b) adding all of these P's and dividing by the total 
number of pairs to get an average. This is 
labeled as D, and It goes in the numerator of the 
formula. 

• 

(c) manipulating the P's^as shown in the second 
formula to get 



This value is then divided by n(n- lLi,4n^ the 
square root of the result then pro^n^s the 
denominator of the formula. 



(d) dividing mSkratOT by denominator gives us t. ^ 

(e) the computed as in all other tests we've 
/ covered, is then compared to the critical t 

for the appropriate a and d^. 

IN THE DEPENDENT SAMPLES ANALYSIS, « n -1, 
Where n • number of matched pairs. 

Now for an example: ^ i . 

Example A 

Let's work with the same experiment outlinecl in an earlier 
example. A basketball coach receives an official "NBA" train- 
ing film that is supposed to teach "all one needs to know" 
about shooting techniques. He wonders whether showing the film 
will have much effect* on the performance of his 9th grade stu- 
dents. He decides to do an experiment in which half of the 20 
students in 11:00 section will watch the film while the other 
half will spend the same amount of time in "free play." Not 
wanting to t^ke the chance that height could be a factor, he 
Qiatches each student in the film condition to a Student of 
similar height in the "free play" condition. Dufing the next 
period, he gives the students a "free throw" test in which 
they are allowed 10 chances at the foul line. Results are as 
follows: 

Film: Play: V 

Pair & Treatment 1 Treatment 2 - (Xi-X? ) 

1 Rick 8 Bob 

2 Mike 8 Fred 



3 . Tom| 4 Alex 
A Rob^y 10 Mick 



5 Steve 2 Carl 

6 Don 7 Roger 

7 Tom A. 6 Bill 

8 Jim 7 Marc 

9 Buster 5 Fran 
10 Pete 0 Ed 



10 


-2 


4 




0 


0 




-4 


16 


9 


1 


1 


10 


-8 


. 64 


6 


1 


1 


10 # 


-A 


16 


9 


-2 


4 


5 


0 


0 


4 ' 


-A 


16 



\- ZV - ,-22 EP^-122 
V « ^2.2 

Before continuing, make sure you understand that members of a 
given pair were matched ii^ this experiment according to height. 
Each then received a different treatment, the score under which 
is shown adjacent to the name. Most important, make sure that 
you understand how the P's were derKred, as this variable is the 



♦ 282 

ERiC ^ ' ' 29 y 



one required to get the analysis started. 



First, we need to get-<he value that goes in the error 
term (denominator of the main equation). Substituting values 
in the equation, we get: 

122 - (22) 2 . " 122 - 48.4 - 73.6 
Zd2 » 73.6 ^ 

Now we are ready for the main equation. Tfte numerator 1^ Is 
the average difference betweett pair members. As computed 
above it equals -2.2. 

t - -2.2 » -2.2 , - 



/ 73.6 HTa 
\j 10(9) \f 90 



-2.2 -2.2 - -2.43 



>y.8l8 



.904 



If the significance level being applied was a - .05* 
what would the decision be? 

Rememberi>t8^at in the dependent samples design is 
equal to* w - 1 (where n represents the number of pairs) , 
we get 10-1-9 deRrees of freed o m . The t table shows 
that the critical value for this cffi is 2.262. We, there-, 
fore, have a sufficient basis for rejecting the null, but 
because the computed t had a negative sign,n;e want to be 
sure to conclude that Treatment-2 C'Play") was favored over 
Treatment-1 (•'FiW) . (Note that the positioning •pf. treat- 
ments was arbitrary - we could have made "Play** the first 
treatment just as easily, in which case the computed value 
would have •had a *V sign.) ' ^ 

If the sfgnificance level was*a - .01, what would the 
decision be? , 

RETAIN THE NULl^ as the critical value for Ofi - 9, t 
3,25, exceeds tie comRuted value. V 

The possibility of reaching these opposite conclusions 
Should reconfirm in your mind the importance of the selected 
a level to the interpreltation of research findings. 

Given the formula for .computing t in a dependent samples 
analysis, there is really nothing much new to learn here. 
Thus, we'll go immediately to a practice posttest, which will 
test your skill at doing the analysis on your own. 



283 



2y, 



9 A 'POSTTEST (answers In h^ck) 



1. A nev sedative Is developed and tested* In terras of Its effects on 
reducing pulse rate. A random sample of 30 adults Is administered 
the drug, whereas another random, sample Is administered a placebo* 
Pulse rate of the two groups Is then analyzed* Results are 





n 


X 


.2 




Drug 




79 


100 




Placebo 


30 ' 


91 


95 





^ a. What Is the computed t value? 

b. What Is the critical t value for a - .01? 

>^ c;' Should the null be rejecte.d or retained? 
d. What type of error Is possible? 



2. A study Is planned In which the effects of a new vitamin compound 
will be compared against those of a placebo on an Incidental outcome — 
the growth of new facial hair. Given the nature of the treatment and 
outcome variables, which of the following would be the best choice 

as a matching variable for a dependent samples design?^ Why? 

. -a. IQ c. height ^ popularity • 

b. sex d. physical fitness * 

3. Attitudes toward busing are analyzed as a function of whether adults 
live In a large city or In a small one. Given the nature of the 
treatment (city slze^ and outcome variable (attitudes), which of the 
following would be the best choice as a matching variable? Why? 

a. height c. physical fitness e. popularity 

b. sex d. race 

4. Analyze the data obtained In the following experiment. Studerits In 
a fourth^grade class are matched In IQ. One member of each matched 
pair receives written teacher comments ("Qood going,** ''Nice work," 
"Try better next time," etc.) on his/her returned answer sheet from 
Arithmetic Test #1. The other member receives no comments. Compari- 
sons between groups ate then maclej^th regard to performance on Test 
#2. The .05 level of significance Is selected for the evaluation of 
results. Results are as follows: * # 



ERIC 



284 



2<Jj 



Test Scores / 
Student Receiving Student Receiving 

No Comments 



75 
90 
77 
77 
88 
65 
60 
93 
81 
66 





Comnents 


Pair ^ 




1 


80 


2 


98 


3 


75 


4 


83 


5 


100 


6 ^ 


63 


7 


79 


8 


94 




90. 


10 


70 



a. What is the computed tl 

b. Wlrat ie ^the critical t for a - .05? 

c. What is the decision? 

d. What type of error is possible? 



i 



, 28| 



Instructional Utiit : Testlf>g the significance of the correlation 

coefficient ^ 

^ - • m 

In this section, we will lllj^trate ]iow hypothesis testing can be 
applied to statistics other than* means, X's. At the same tlipe, you will 
J.eam about a very Valuable and commonly used analytical technique — that 
for testing the significance of the correlation coefficient 

Example Situation " , . * 

Dr.*Wynn Flynn, a noted clothing researcher, believes' that there is 
a slgnificmt relationship between. the length of people's toenails and 
the amount of wedr they get from socks. To test this (using $1,000,000 
grant from the American Toenail Association) he selects a random sample 
of 100 adult males. He measures the length of each subji^ct's toenail 
(right foot, big toe) and the amount of wear obtained from a brand new 
pair of dress socks Cnumber of days before hole appears). (Out of con- 
sideration for the subjects' friends .and families, each subject is 
required to wash the socks^ each night:) The correlation obtained is 
^ - -.47. Can you interpret this? Shame if you cannot and you're this 
far along in the book! 

Dr. Flynn now wants to publish his findings tA^upport the conclusion 
that longer toenails are associated with less wear f)rom socks. Is that 
conclusion justified? ^ . ^ 

WeLl, the negative correlation obtained does support, that trend, BUT 
any result is possible from a sample. How do we know that this result 
represents what w6uld happen far an entire population of adult males 
possessing big toes and similar types of socks? 

Answer: We don'tl Just like in the case of means* (X's), all we cari-* 
db is m^e probabllstlc judgments based on the results obtained and 
sta^Lo^itical' theory. • We have a null hypothesis -here too. * It essmntidlly 
says "no difference—no effect (no correlation!)^ for the population. 
If the weight of^ evidence makes that hypothesis' appear improbable, the 
conclusion is that "a difference or effect (a correlation!)" does seem, 
likely for the population. Let's illustrate the hypothesis^ testing pro-- 
cedure, uslng^ Dr. Flynn' s results as an example: 

• ^ , 

\.< Hypotheses : Hq : p^^ = 0 ^ 

■ Ha. •■ Pxy ' . : 

The null, establishes a condition^, in which there 'is no correlation 
(p) between X (toenail length) and Y (sock wear) for the 'population. If 
evidence favors "rejection" of the null as Improbable, we are left with 
the alternative , that thet^ is a correlation. 




ERIC 



2, Random sampling and data collection , Th^se «Q wittout saying. 
Obtain your random sample, "^as Dr. Flynn did, and collect your data. 

3. Compute statistic and test H/^ . Since* our null conceWd cor- 
relation, we naturally want to compute the sample correlation\/L. Dr. 
Flynn's A was -.47. Is that a sufficient finding to view the^jiull 
(which states no correlation) as improbable? Just like for 0ut tests 
involving means, need a formula: , ^ , 

• s 

t = 



V(l-A2)*/(n-2) 



di = n - 2 



r 



Yes, when Pxw = 0, the above outcome is distributed as t. It thus 
provides a basis for^oncluding whether the A we obtained .is improbable 
if the population correlation <Px(/) Is actually zero. If so, we reject 
the nuli! Note that di for the distribution is n - 2 (number of subjects- 
2). Let's work with Dr. Flynn's results, using .01 as our significance 
level: 



V (l-(-/»7)2)/(100'-2) 
-.47 -.47 



.089 



,779/98 . * ' » 

= -5.28 . «^ ' •• . . ^ 

Now check the t table for df} = 98 at a = .01. . . 

4. Conclusion . The computed ^ value of 5.28 clearly exceeds the 
tabled one of about 2.66; The sample correlaticvn appears "significant, 
meaning, "reject the null!" We can infer that a negative correlation 
between toenail size. and sock wear probably exists for the population. 
Any chances, Pf a Type I error?. Ye8~l% (assuming the null were actually • 
true), i^ny chance of a Type II error? None! We rejected the null! 

Another Example - 

Dr. Flynn suspects that there will be a significant negative cor- 
relation between head size and time taken to put on sweat er&. He 
/ employs a sample of 25 women for this, and tests them using appropriate ^ 

* materials and measures.. The , resultant correlation is7L=^>-.lA. 

Flynn's feeling pretty good about himself, since the result was 
' just the way he called it~a negative correlation. His hea4 is 
^getting so big that the swea<ter he's wearing may never oome off. 
But we know about sampling fluctuation. Maybe* his result is quite 
reasonable as a product of sampling fluctuation. Maybe had^he 
tested everyone, the correlation would have been zero! Let's do 
the test (using a = .05) . ^ 



287 3{ 




-.14 



- . 14, 




yj .98/23 



Wl45ih =» 23, the critical region for rejecting the null is defined 
(see t table) as + 2,069 for a « .05. Our computed ^ value wds only 
-.678. There is not sufficient Evidence fo;: rejecting the null. It 
could well be the case that were all women to be tested, head size would 
have no relationship to "sweater putting on" time. Flynn is foiled by 
his foolish forecast. 



POSTffEST (answers in bapk) 

1. A researcher correlates\two variables using a sample of n = 30^ The 
obtained A .36. He wishes to test its signifle^nce (relative to 
p = 0) using a « .01. . V 

a. What kre the null and alternative hypotheses? 

b. *What is the tabled t value? 

c. Vfliat is the computed t value? 



2. Answer th6* ^dame questions as above (a-e) giVen X = .78, n = 16, 
a « .05.-' 



What is the decision? 



e. What type'of error (I or II) could have been made? 



3os , 



288 



9.3 . Instructional Unit ; Using confidence Intervals for parainetei>«stlinatlon 

The analytical techniques discussed In the two previous sections 
are used to test hypotheses regarding -one or more populations. Example 
questloris: Is Treatment A better than Treatment B (for all people 
receiving each)? ' Is there a positive correlation between X and Y for 
all people In Papulation A? Again, the formulas allow us to test spe- 
cific hypotheses. But suppose that we have no hypotheses in mind and 
Instead, simply want to estimate a population parameter that Is unknown? 
Maybe you are a real estate salesman and want to estimate the square 
footage of a typical house In your community. Or, maybe you re a college 
Instructor and want to know how much time a typical student devotes to 
studying. No hypotheses or no actual figures may exist for either of 
these concerns. But the notion of Sampling Distributions, Expected 
Values, and Standard Errors can still be applied to allow us to estimate 
the "answers." 

Point versus Interval Estimation 

A point estimate of a population parameter, let^s say "^^^ ^ 
single value as the basis for estimating the unknown. The real estate 
salesman In the above example simply selects a random sample of houses 
and measures their square footage. Let's pretend tha^ hei obtains 
X = 14,962. That single value then becomes the basis 'fpi: estimating U^. 
Using the same logic the college professor surveys a random sample of 
students on the amount of time they devote to studying. He obtains 
X = 96.34 min. a night, which becomes his estimate of U^. 

Point estimates are nice and convenient. But, based on your knowledge 
of sampling fluctuation, can you see any problem? Sure.... If you hap- 
pen to get a weird sample, the Impression you obtain (and maybe rely on) 
could be horribly misleading. This concern creates a ratloi^ale for a 
different type of estimation. Involving Intervals. 

Interval estimation specifies a range of values that Include the 
unknown population {Parameter a certain percent of the time. The >conven-- 
' tlonal term statisticians use In referring to these ranges Is confidence 
Intervals . The level of "confidence" expresses the statistician s 
confidence that tht true parameter Value Is contained In the Interval of 
values. For example, we might compute, using methods to be described 
below, the 95 percent confidence Interval for the housing square footage 
Ux to be from 13,600 to 14,400. We can say "with 95% confidence ^that 
the actual population mean lies somewhere within this range. Let s 
examine more closely how confidence intervals are constructed. For 
simplicity and clarity, we'll restrict our examples to instances involv- 
' in,g means (confidence Intervals, though, can be applied in the identical 
manner to other statistics as well). 

Constructing Confidence Intervals 

Let's suppose that scores on a standardized spelling test for 
second graders are normally dljetributed with Ux = 250 and o =r 50. 



/ 



289 

Sill 



You are unaware of these figures and select a random sample_of 100 scores 
to estimate u^^. Naturally, the basis for your estimate is X, the mean 
'of your sample. It is highly unlikely that this point estimate will be 
exactly equal to Uxt due t£ sampling fluctuation. But you could con- 
struct an interval around X that will have, a certain probability (give 
you a certain degree of confidence) of containing u^. A -conventional 
probability or confidence level is* .95. What interval will have a 95% 
chance of containing u^? 

Sampling theory provides the answer. Loolc below at the Sampling Dis- 
tribution for X's computed from all unique samples of n = 100 drawn from 
that population. As reviewed in Unit VII, E(X) = Ux, and o— = o/yfrT . 
What portion of the distribution would contain 95% of the sampleife? Ans- 
wer: That between +1.96 and -1.96 standard errors (z's). How do we 
kpjswJ-^ Normal curve probability tff course! Ninety-five percent of 
scores always fall between + 1.962. 



I 




-1.96 0 +1.96 



Now let's reason the s^me way 
we have computed a single X, what 
contain u^? Answer: X + U96o— . 
tions: " ^ 

1. What percentage of sample 
Answer: 95% (look at the 



, but (sort of) in reverse. Given that 
interval around it is 95% likely to 
Think about it and try these ques- 

X's would be within + 1.96otr from u ? 
figure) - X X 



,/;. 

290 




2. So, If you added 1.96<Trr to each X to form an upper limit and 
then subtracted the same value to form a lower limit, what 
percentage of those intervals (one for e&ch sample), would 
contain u^l * 
Answer: 95Z 

3. Huh? 

Answer: Lpok at the Sampling Distribution. Isn't it true that 
on the left side, only the 2.5% in the shaded area 
would never reach with their upper limit? Isn't^it 
also true that of those on the r;l.ght side., only, 2.5% 
would never reach li^ with their lower limit? 

4. What, then, is the 95% confidence interval for X? 
Answer: X + l.'^6(r^ 

Thus, if we add and subtract 1.960^ to X, we can be 95% confi- 
dent that the resultant interval will contain u^. Or, to say 
it in a slightly different way, if we were to repeat the pro- 
cess for all sample X's," 95% of the confidence intervals would 
"capture" li^. while 5% would not. 

5. Would a 99% confjidence interval be wider or narrower than a 
95% one' 

Answer: If you got this, you're really thinking! WIDER is the 
only choice! If you want to be more confident of 
^ capturing li^, don't you have to cover more ground?? 

6. What would the 99% confidence interval be? 
Answer: X + 2.580^ 

^' Answer: Isn't it true from normal curve probability that 99% 
of the scores fall within + 2.58Z ? By the same 
reasoning, if limit^s based on those dimensions were C 
formed around the X's of all unique samples, 99% of 
the resultant intervals would contain li^. 

8. - -Can one give a general expression that would represent confi- 

dence intervals using normal curve probability (i.e., Z values) < 
Answer: Yes 

9. Well, what is the expression? 

Answer: X+ (ZconK > °X' meaning (a) determine the z value 

coTresponaing to the particular level, multiply it by 

the standard error, add it to X for an upper limit and 
subtract it for a lower limit. 

10. Suppose your computed sample X in the previous example (spelling 
scores) was 240. What is the 95% confidence interval? What is 
the 99% confidence interval? 



3.Jo 



« 



Answer: For both levels cfj^ « o/^n" « 50/10 «^ 

Coni.95 *240 + (1.96) (5) 

, « 230.2 (lower limit) and 249.8 (upper limit) 

We can'be 95Z eoivftdent that this interval contains u^^. . Note, 
however, that this is one of the 5X that actually doesn't 1 1 We 
obtained an extreme sai^le (u^ = 250, see page 289) . 

* 4 ' 

Coni^qq ^240 + (2.58) (5) 

= 227.1 (lower limit) and 252.9 (upper limit) 

We can be 99% confident that this interval contains u^. (And it does . 
See how the larger, 99!K-4nfterval, can be more accurate than the 
smaller, 95% interval? The trade-off , though, is that the 99% interval 
gives a less precise estimation.) 

Uslhg t as the Position We&sure 

In the above example, we had the luxury of trying to estimate/Uj^ using 
known properties of the population it describes (that o « 50) . aL was 
discussed in Unit VIII, when we introduced the t distribution, it is 
usually the case that o won't be known. So all we can do is to estimate 
it using our sample standard deviation, 6. But, when Sampling Distribu- 
tions are constructed with 6 in the standard error formula, they conform 
to t rather than z probabilities. If you're not following all of this, 
no need to worry. Maybe by examining the process, the underlying theory 
will become a little clearer. 

Suppose, for exan4)le, that you are interested in determining the 
average nuirf>er of daily phone calls received in your office. Your boss 
supports you on this, but says that he will not permit you to spend time 
surveying the calls every single day. So, you decide to randomly select 

/36 days — the boss grants permission. You count the calls and compute 

\^ur statistics. The results: X « 16.5 and 4 « 4. 

Your interest naturally is with estimating the average for all days, , 
of which the 36 eKamined is only, a very small sample. Using point esti- 
mation, you would report 16.5 calls as typical— but imagine the room for 
error! Confidence intervals to the rescue .. .What is the 95% confidence 
interval ^f or our estimate? Since we are using 6 rather than o, the 
"general" expression for such an interval is 

If you look back to the previous examples, you'll see that nbw we're 
asking for (a) the t score, not the Z score, that corresponds to the 
probfifl)ility level desired, and to (b) the standard error based on 4, not o. 



292 



/Let's identify this mysterious confidence interval: 
• f 

X = 26.5 

" 4/ yf36 = .667 
But what about 

The t distribution varies based on d^. Our (1(5, In this case, Is 
36 - 1 « 35, Looking at the t table, we find that ^.95 or .05 2.0A2 
for the closest d^. So, we get: 0 

Cow^.95 = 16.5 + (2.042) (.667) 

- 16.5 + 1.36 ^ • 
» 15.14 (lower limit) and 17.86 (upper limit) 

We can feel 95% confident that the actual mean falls within that Interval. 

i- \ .. 

W^iat would the 99% confidence level'be? The only change here is Oiat 
we need the t g^ value (same in table as t.oi)» which is 2.75. Thus: 

= 16.5 + (2.75) (.667) 

- 16.5 £ 1.83 

, « 14.67 (lower) and 18.33 (up>er) 



We can feel 99% confident that the actual mean lies within that interval. 

Can you see the relation between confidence intervals and hypothesis 
testing througl> these examples? In hypothesis testing we compare our 
sample mean to some hypothesized value; using confidence intervals, we 
try to provide an estimation of the population mean using the information 
we obtain fijpm the sample.. Both procedures employ the identical informa- 
tion — X, 4, n, ahd t value to produce the desired outcome. For each 
hypothesis test we have presented, there is a corresponding procedure 
for constructing confidence intervals. 

Take, for example, the case where we compare two sample means using 
Independent samples (Unit 9.1.1). As ihtroduced in this unit, the interest 
was with testing th^ hypothesis that the two populations represented did 
not 'diffe.r from one another. On the other hand, we can simply be interested 
in estimating what the actual difference, if any, is. To avoid burdening 
ourselves with , a whole new series of calculations, let's return to the 
example on p. 274. What difference really exists between using programmed 
instruction and lecture for that lesson? Our samples indicated a 6-point 
advantage for programmed instruction over lecture. So, we could give that 
as a point estimation — and hope that luck is with us. Or, we can be more 
cautious and construct a confidence interval around that value? * What 
would the 95% confidence interval be? 



293 



3Ud 



ERIC 



The general formula remains the same, but there are some values we 
need to identify: 

Coni^^ Xi - X2 ± Ucon^) " % 

^aon^ « t 95 for = 53 » 2.02 

^Xi ^^^^^^^^^^ ^ formula (see p. 275) « 1.46 

C(?n<{ 95 » 6 + (2.02)(1.46) 

= 3.05 (lower) and 8.95 (upper) 

We can be 95% confident that the "true" treatment difference lies 
between 3.05 and 8.95 points. 



Summary * * 

Using the procedures Juat Illustrated, a confidence Interval can be 
constructed around any of the statistics we have examined. All that one 
needs to know ^re the t (or Z value) and standard error corresponding to 
the statlstlc*s Sampling DlstrlbutlotfT^If you really want to test your 
understanding of the methods, see If you can Identify the 95% confidence 
Interval for obtained In Example A (p. 283). The answer Is typed upside 
down at the bottom of this page. 

* 

Confidence Interv^als provide a way of estimating population parameter^ 
using a "probable" range of scores'. The mo ree, confident you want to be 
about that range, the wider you must make it. Thus, a 99% confidence"" 
Interval will be wider than a 95% or 90% Interval. The* confidence inter- 
val procedure contracts with point estimation which uses a single value 
to represent the population. Though logically arid functionally related 
to hypothesis testing, confidence Intervals are mainly concerned with 
estimating not comparing. Try to "estimate" your understanding of these 
ideas by answering the questions below. 



(jaddn) 9x*-puB (aawot) ^Z*^- 



294 

ERLC ^Uj 



9.3 POSTTEST (answers in back) 

I. Differentiate between point estimation and interval estimation. 



2. A 95% confidence interval is constructed around a sample mean. Which 
of the following is true? 

a. There is a 95% probability that the interval will contain u^. 

b. If such intervals were constructed around all sample 
' meatf^, 95% of them would contain . 

c. Such an interval will be smaller in size than if a 99% 
confidence interval was used. 

^ d. All are tTue. 

3. Psychometrician Jones obtains a new "Leadership Abilities" test 
and want6rto>ee^imate what ap average score would be for all 
people who might take it. After administering the test to a 
random sample of 36 people he obtaliis^ X = 68 and 4 = 12. Report the: 

a. 95% confidence interval around X; 

b. the 99% confidence interval; and 

c. the point estimate of 



ERIC 



295 



3io 



Pnlt IX Review Test 
(answers In ba^:k) 

A. Multiple-Choice 



_1. A researcher rejects the null, when in reality the 
null As true. He/she has made 

a. a Type I error c. both Type I and II eirrors 

b, a Type II error d. a Type IV supreme error 

jt. If you were doing an experiment for which you wanted 
to increase your chances of rejecting the null, which 
of the followjlng procedures would be helpful? 

a. increase n (i.e., test more subjects) 

b. increase a 

c. identify an intervening variable that strongly 
relates to the outcome variable, and use it for 
matching in a dependent samples design 

d. all of the above would help 

e. none of the above would help 

_3. A medj.cal researcher is doing a study involving a 

comparison between two surgery techniques. Naturally, 
it would be very dangerous to conclude that a parti- 
cular technique is better than another, if it actually 
is not . In selecting a significance level, the re- 
searcher would be mostly concerned with 
^ a. minimizing the chances of a Type I error 

b. minimizing the chances of a Type II error 

c. maximizing the chances of a Type I error 

d. maximizing the chances of a Type II error 

A. A study is conducted in which subjects are assigned 
strictly at random to two treatments, and outcomes of 
the two treatments are then compared. The type of 
hypothesis test that would be apf^roprlate here would be 

a. Z-test comparison with a numerica'l parameter value 

b. ^-test comparison with a numerical poEameter value 

c. ^-test comparison for independent saii^!>»s 

d. ^-test comparison for dependent samples 

5. An experimenter wants to test the hypothesis that the 
GRE scores of a certain population of students are 
equivalent to the national average, which he knows to 
be 500 with o - 100. . 



ERLC 



Which one of the tests listed in the a;iternatives for 
item tf4 would be the most appropriate? , • ^ 

6. Fifty students are randomly assigned to' Treatment A 
and 48 are randomly assigned to Treatment B. The ap- 
propriate for a comparison of ^means would be 

. a. 48 d. 96 

b. 50 e. 97 g. 100 

c. 49 f. 47 • • 

7. Fifty students in Treatment A are matched on some 
variable to 50 students in Treatment B. The appro- 
priate d({ for a comparison between means would be 

a. 48 d. 98 

b. 49 e. 99 g. none of these 

c. 50 f. 100 

8. An experimenter uses the dependent samples design, but 
selects a matching variable that has a zero correla- 
tion with the outcome variable in his experiment. 
Which of the following best describes the implications 
of what he has done? 

a. even though a "better" matct^^ng variable could 
have been selected, he has still strengthened his 
analysis by using th^ dependent samples approach 

b. . aside from the extra effort involved in matching 

people, he will come out the same as if^he had 
used the independent samples approach 

c. he will lose d({ without achieving a stronger test; 
thus, the procedure would have been to his dis- 
advantage 

d. the chances of making a Type I error will increase 

* 

9. In the dependent samples analysis, V refers to 

a. the difference between the means of Treatments A & B 

b. the standard deviation of a particular treatment 

c. the variance of a particular treatment 

d. the difference between the scores of members of 
a matcl;)ied pair 

10. A study is performed In which the effects of dyinking 
different quantities of alcohol are compared with re- 
spect to college seniors' ability to roller skate. 
If considering the possibility of using dependent 
samples, the most appropriate matching variable would 
be 

a. height d. grade point Javerage 

b. age f. amount of prior skating experience 

c. IQ 



3i;; 



B. A sample of n • 25 is tested, with results indicating X = 525 and 
6 « 112. Compute^the 99% confidence Interval ^and the 95% confidence 
intervals around X. 



/ 



298 ^1 J 



r 



UNIT X • , , 



** THE .CHI-SQUARE TEST f 

A. General Objectives ^ ^ 

The statistical procedures discussed thus far, with one exception, 
are designed for use with interval or ratio scores— scores Indicated by 
numbers on a continuum having equal distances between thett. The excep- 
tion was the Spearman correlation (Unit V) which ds applied to ordinal 
or rattked data* But there are also many Instances in which data will be 
nominal or categorical. %or example, is there an association b^etween 
people's membership in charitable organizations and how they vote in ^ ^ 

elections? At^ the city pqrks fi^uented more by one sex (qr racial 
groun age group) than another?/ "Scores" here are essentially fre- 
' • q.uentl,y counts of different suJ>p^ulatlons. How would such nominal data 
by^alyzed? The purpose of tfils unit is to acquaint you with, the uses 
and computational procedures of a test designed speciif Ically for that 
purpose — the chl-square test. ' # *^ 

. * 

B. Specific Objectives - ^ 

# 

* 10. 1 Identify the type of scores analyzed by the'^chl-square test 

10.2 Use chl-square in a one-way test - ' 

' 10.2.1 Test for equal f frequencies 

10. 2. 2^^^ Test goodness-of-fit ^ 

10.3 Use chl-square in a two-way tSst ' * % > ^ 




10.1 Instructional Unit: Nominal scores and their analysis th roughs iSi^-squa re • * 

^ J 

Scores indicating examination pe,rfonnances, outside temperatures, 
yearly Incomes, size of houses, populations of cities, and on^nd on, 
all constitute Interval measures or ratio measures. Each obseVatlon 
can take only one val^e in a given assessment CJohn can have only^ one 
score on the test), and can be ordered on ^ continuum with other ob- 
servations on \he basis of. that valtje (John's score was 87 which is higher 
than Tom's 86); Further, thfe^ distances between the possible points .on 
the continuum are equ^l anfl h^ve the same meanings (Tom's 1 point deficit 
relative to John means the same as his 1 point advantage over Ken's 85). 
Interval measures do not have a natural zero point (e.g., a test score 
c^zero usually does not indicate the total absence of kn^wlec}^ while 
X ratio measures do (a "money earned" score of z<iro means "fexactly that~a 
* complete* absence of earnings 1 1 )yfi^ . . ' ^ m* 

Then we can move from Interval and ratio measures to the someWhat 
lefis precise ordinal measures which might be thought of as rankings. ^ 
Although ranks can be arranged In a numerical continuum, dlstanees . 
between points on that continuum may be wi/lely uneven. The person ranked 

•299 ^31^^ 



#1 might be far superior on the trait iti question to #'s 2 and 3 who J 
are actually quite close to one pother. IS the distance between 1 and 
2 the same ^s that between 2 and 3? Not by a long shot. 

Flna]i(|^, we can moye to an even less precise but Just as important / 
type of measure — nominal or categorical . Here we get neither a meaning- 
ful numerical score nor a rahk» but simply an indication of membership 
or nonmembership in a particular category. Are chutch-goers more likely 
to participate in local charity drives than those not attending church? 
If we take a random sample of people participating in local chdri^ func- ' 
tions, and Glenda Mender is a subject^ what will her score be? Certainly 
not an interval or ratio scBre like a 97, or a rank IjLke a 28. The only 
pp^ibility here is a nominal score, indicating to which group, church or 
^Wfciurch , she belongs. When all subjects are "tested" we will simply 
end up with a frequency ^unt indicating the number of observations * 
^ (subjects) in each nominal category. On that basis, we can do our com- 
parison: 80 out of 100 were church-goers ; 20 were not . Interesting. . j 

A statistical test specifically designed to enable such comparisons 
^is the chi-square (x^) test. As will be seen below, general categories 
uses pertain to cases in which Jthere is one Variable (are political 
^ parties equally represented in the Senate?); and two variables (are 
political parties and seniority related for members of the Senate?) . 
Again, what x"^ Is looking at is nominal data — the frequencies of indivf- 
. duals who fall into different categories. It would allow us to make a 
statistical Judgment as to whether, for example, a count of 54 Democrats 
and 46 Republicdns does suggest unequal representation.. Further, half 
of the Democrats might have 10 or more yearS^of seniority while only one- 
l third of the. Republicans do. Is there a significant relationship there? 

Chi-squaire would help us to find the answer. / - 

10.1 POSTTEST (answers in back) — ^ 

1. Differentiate between interval-ratio, ordinal, and nominal measures. 
To which is the chi-square test applied? Give an example of the type 
of research question to which it might be applied. 



V 



300 o f 

^ SI 



KIC 




10.2 Ins true tlonaTThrlt ; The one-way test, 

« ^■ 

In this unit, we'll use the term one-way to describe applications 
in which one variable (e.g., race, sex, blood type, school attended) is 
being examined. Two related uses will be examined, one that tests for 
equal frequencies and another that tests for goodness-of-f it . As will 
. be seen, the procedures for both tests arfe virtually identical. If you 
can do' one you can do both. 

10.2.1 Testing for Equal Frequencies 

Suppose that at the monthly neighborhood school PTA meeting, we count 
120 people* in attendance. Further inqui*ry indicates that 60 of these - 
people are parents, 45 are teachers, and 15 are "others" (students, ad- 
ministrators, maintenance people, etc.)* Can we conclude that attendance 
is evenly distributed across these three categories (parents, teachers, 
or others)? Having con^ this far along in the book, you should be ashamed 
for trying to "conclude" anything. These 120 people ar6 Qnly a «araple 
of all those \(ho might attend such meetings. All that we can do is use 
this sample as^ basis for inferring what might be found if we could ^ 
characterize the total po*pulation of attenders. ^Ju3t like we did in 
other inference-making situations, we will state a null hypothesis, run 
a statistical test, and then make an inference (not a firm conclusion) 
on that basis. * 

Since we're dealing with nominal data (frequencies), our test is x • 
But what would our null^ hypothesis be? For these types of analyses, the 

following general statements can be used. ^ 

,. . . ' 

Ho : 0 = E 

«a O f E , ^ _ . 

Translation: The null h^puothesis says "What is observed (0) will be the 
same as what is expected (E^)--riothing unusual!". The alternative hypo- 
?. thesis, of course, sayja^ the opposite. "But what is expected ?" you might 

ask. Well, since the null refers to the "no difference" condition, we'd 
expect in this stud%, that out of 120 people attending PTA meetings, one- 
third ( n - AO) would be teachers, one-third parents, and one-third 
"others" — an even distribution. ^Let's run our test to Sjse whether the 
evidence seems .consistent with that condition. W6 need a formula; 



,2 . r 1^ 



2 



where 0 is the obtained frequency and E is the expected frequency in 
each category. For our example, we can tabulate these as follows: 

• PTA Study H ^_ 

Categories (Groups) 
Total 



Observed (0) 




60 


45 


15 


120 


Expected (E) 




40 


40 


40 


120 



ERIC 



301 



Remember, the 0* s 'are what wa^ actually observed. The E's are what we'd 
expect if there were no differences in group attendance. Does 0 = E, 
ive., does the null appear true? Application of the formula will be the 
deciding factor. ^ . 

2 _ ( 60-40 )^ ( 45-40 )^ . ( 15-40 )^ 
' ~ .40 ^ 40 40 

400 + 25 V 625 
40 

1050 



X 



40 



= 26.25 



So, = 26.25, what do we do now? Just like for Z or t, used for 
analyses of means, has a probability distribution. The probability 
distribution, like these other ones, indicates the likelihood of obtain- 
ing different x^ values were the null to be true (i.e., 0 = E). So, 
again, we might take the .05 or ^01 level as a basis for inferring that 
our sample represents a population where 0 ^ E and justifies rejection 
of the null. 



If you'll turn to Table III in the Appendix, you'lHjpee such a chi- 
square table. Like the t table, it lists only "selected** probability 
values (.01, .05, .10) — the ones conventionally used in hypothesis test- 
ing. But, also like the t table, there's a "catch" in the form of 
You need to know in order to obtain the critical value. For reasons 
we Won't go into in this introductory book, is determined in * ^ 

one-way (one-variable) cases by the following simple expression: fe-I, 
where fe = murh^ 0^ cattgonA.U ok g^oup^. in our case, fe = 3; so dj( = 2. 

Now we're ready. Suppose we want to test the null at a = .05. What 
is the criti^l value we need to reach or exceed? For djj = 2, it is 5.99. 

What is the conclusion in our study? Reject the nulTL. I^esults sug- - 
gest that the three categories (groups) are not equally represented. 
Based on the frequency table, a reasonable inference is that fewer 
"others" attend PTA meetings for this school than do parents and teachers. 

a 

Another Example ; Let's suppose that we repeat the study at some other 
school in a different state. Results for 180 people attending the meet- 
ing are as follows: 65 parents, 62 teachers, and 53 "others." Let's 
set up our 0 and B frequency table: 

PTA Study //2 









Categories 


(Groups) 


Frequency 


Parents 


Teachers 


Others 


Total 


Observed 


65 


62 


53 


180 


Expected 


60 


60 


60 


180 



302 

er|c' . ^ 3 a 



Note that |5 = 60 becomes the E value for all, since if the 180n were 
evenly divided, 60 people would belong to ea<:h group. Also, will be "2 
since fe-J«2. Let's test at a = .05 again. 

2 ( 65-60 )^ . ( 62-60) ^ , ( 53-60) ^ 
X ~ 60 60 60 

^ 25 ■» 4 + 49 , ^ 78 
• , ° 60 ° 60 

= 1.30 

What is the decision? 

Retain the null. The critical value, 5. 99, was not reached. There - 
is no basis for rejecting the null hypothesis of 0 « E. ^/ 

10.2.2 Testing for Goodness-of-Fit | 

Directly related to the above uses of chi-sqiiare (i.e., testing fq|r 
equal frequencies) is the goodness-of-f it test. The null hypothesis 
statement ( 0 =s E) sad computational procedures are identical to that 
described in the previous section. The only difference is that we are 
comparing our observed (0) frequencies to an expected (E) "pattern" 
which can take ajr^ form, i.e., have unequal frequencies across categories, 

For example, suppose that administrators at a certain university are 
concerned with grade inflation. They dis'tribute a memo to faculty 
recommending that grades be awarded in roughly the following percentages. 

A B C D F Total 



% : 10 20 40 20 10 | 100 

Professor Warmglow has the reputation for being an easy grader, so 
his department chairman calls him in to present his grade distribution 
from the pre^vious seiaester. Warmglow had 130 students who received the 
following grades: 

A. B C D F 

^ : 25 AO 35 20 loH 

Warmglow does seem more generous than the university would like. Is he 
a grade inflator? Let's test goodnes^-of-f it , using a « .05. Hypotheses 



are; 



Ha • V E 

Now we'll construct our frequency table. In doing so, note that the E's 
here will reflect the expected proportions of students in the different 
grade categories. • For example, based on what the university expects, 10% 
of Warmglow's 130 students should have been A's: E^ = . 10 x 130 « 13; 
20% should have been B's; Efa = .20 x 13Q = 26; etc. Our frequency tabl^^ 
thus becotaes: • 



303 



313 



Grade Inflation Study 



7~ ' 

/ ■ 

f\' Frequency ^ 




A ' 


B 


c 


Categories (Grades) 
D F Total 


1 Observed 




25 


40 


35 


20 


• 10 




. 130 


! Expected 




13 


26 


52 


26 


13 




130 


1 We compute 


in the 


usual 


way : 












0 

CM 
X 


E 


















(25^13)^ 


^ (40- 


■26)^ . 


(35-52) 


2 

+ 


(20-26)^ 




10-13.)^ 




13 


26 


52 




26- 




13 



144 , 196 . 289 . 36 . _9 
° 13 26 52 26 13 

« 11.08+ 7.54 + 5.56 + 1.38 + .69 

= 26.25 



Here djj = 4, since nunber of categories (fe) =5, 
critical x^ value as 9.49 for a « .05, di = 4. 



Table III gives the 



Decision: Reject the g^ll . Warmglow distribution does not provide a 
very good fit, with the University's. 

Do you see that the goodness-of-f it test involved exactly the same 
ptocedures as the "equal frequencies test," except the expected values 
(E's) were based on some model. We applied x^ to determine how well 
the O's conformed to that raocjel — testing for equal frequencies 
were not of concern (unlesp they were that way in the model 1). 

Very* quickly, we'll test to s^e how Professor Warmglow's colleague. 
Dr. Blackhart did in his grading of 50 history students. 









Categories 






Frequency 


A 


B 


C D 


F 


Total 


Observed 


5 


5 


10 15 


15 


50 


Expected 


5 


10 


20 10 


5 


50 * 



It is important before continuing that you understand ho^ the E's 
wer6 derived. Remember, the university recommended 10% A's » .10x50 ■ 
20% B's = .20x50 » 10; etc. 

v2 o (525)^ .1(5-10)^ ,( 10--20) ^ , ( 15-10) ^ , (15-5)^ 
X e ''"/in on in i; 



0 + 2.5 +5+2.5+20 
30 



304 



' since di hasn't changed, the critical value remains at 9.49* 

Decision: Relect the null . Blackhart's distribution doesn't provide a 
very good fit with the University's either. Which distribution would 
students hope your statistics teacher's is closest to, Warmglow s or 
Blackhart's? If you answered Blackhart, you could well be one of the 
people helping to make the distribution that way! 

Final Note ; Whenever = 1 in the one-way test, it is recommended that 
an adjustment, officially known as Yates' correction, be made in the 
formula. When will d^ - 1? Wherever you have only two categories, since 
d(( - fe-1. The correction involves subtracting the constant .5 from 
each 0-E in the numerator of the formula. Why? Mathematical analyses 
have shown that when there are only two categories, the conventional x 
formula tends to overestimate the values that should be obtained if the 
null (0=E) were true. Yates' formula corrects for this. Here It is: 

^a . , l|0-E^- .51' ' , . 

The special vertical parentheses means that we will Ignore the sign 
of the a difference, and treat all 0-E's as positive . Suppose Prof. 
Blackhart's grades were simply reported bb Pass or Fall. Since F is the 
only failing grade, the new table would look Like this. 

Dr. Blackhart's Pas ses /Failures , _ 



Frequency 



Categories 
Pass Fall Total, 



Observed 
Expected 



35 
45 



15 
5 



50 
50 



2 ( 35-45i- .5) ^ , (115-51 -.5)^ 
< ° 45 5 

^ (10-.5)^ O0-.5)^ 
45 * 5 



45 * 



(9.5)' 
5 



- 2.01 + 18.05 
= 20.06 

Looking at the table for K.^ (^^OBfd^")) 
Reject the null. 



3.84 



305 



320 



10.2 POStTEST (answers In back) . ^ 

1. A librarian is interested in determining the popularity of different 
types of books. She defines a ntinber of categories and does a 
frequency count of the nuid>er of books taken out in each. Results 
for a total of 108 books are as follows: Novels=40, Nonf iction«25, 
Children's Books«28, "Other Types"«15. • 

a. Given that she wants to test for equal frequencies, state the 
null hypothesis. Interpret what it means in terms of this 
situation. 

b. What are the observed and the expected frequencies? 

c. What is the computed x"^' 

I d. What is the tabled value, for a « .01? 

e. What is the decision? 

■ • - \ 

f . What type of error is possible? x 

2. Census data for a particular city iiidicat^ that its total population 
consists of 40% Caucasians, 40% Blacks., ISj^^^hicanos , and 5% other. 
The city Zoo Director wants to determine i^/heiftier the racial make-up 
of those visiting the zoo conforms to that diskributioni He monitors 
the zoo attendance and finds the visitors to be^ 35% Caucasians, 41% 
Black, 20% Chicanos, and 4% Other. 



a. Fill in the following table: 

Zoo Attendance 



Frequency Caucasian Black Chicano Other Total 



Observed 
Expected 



Note: We can use the percentages as if they were freqiiencies . 
All that matters is that the 0*q and the E's are on tl^e identical 
score^ scale (i«e;, ^'s ot Vs, etc.) 



b. Test the Zoo Director's hypothesis at a « .05. What 
computed x^' What is the decision? 



s the 



3. Suppose the Zoo Director in the aboVe example didn't know the city 
figures. Instead he wants to test whether the nunber of visitors 
to the zoo varies on the basis of race. Test this hypothesis using 

• (Again, simply use the %'s as if they were frequencies. Your 
E*s will also have to be in that scale, but note that the E's for 
this hypothesis will not be the same as those .for #2.) 

4. Identify the circumstances which dictate using Yates' correction in 
the x^ fortoula. 

306 



ERiC ^^i 



10.3 Instructional Unit : Using chl-s^uare In a two-way test 

In the last section we loqjced at uses of test hypotheses con- 

cerning one variable. In other Instances, we may wish to test hypo- 
theses concerning two (or more) variables. The basic Interest In the 
latter case usually bolls dov^ to questions about the relationship or 
Independence of two variables. Are the two variables related to (or 
dependent on) each other? Are they unrelated or Independent of each 
Other? Examples might be, are peoples' ethnic backgrounds related to the 
way they vote? Is occupation related to where people live In a city? 
Testing for "relationship" might bring to mind the correlational tech- 
niques studied In Unit V. That's an entirely appropriate association — 
the difference, here, though Is that the data are nominal/categorical 
measures (I.e., |('s). Otherwise, the two applications (correlation and 
X^) can be used to answer quite similar research questions. 

It seems best at this point to move quickly to examples, and illus- 
trate the two-ii^ ay ^design through them. Here we go: 

The Rubber Duck Manufacturing Co. has 3 general classes of e^loyees: 
management, worker . (secretarial, assembly line, mechanical, etc.), and 
custodial. In planning for shifts In personnel due to changes In the 
economy, company officials want to know If a relationship exists between 
* job category and location of residence : city, suburbia, country. They 
survey their 150 employiees and get the following results: 



Job Category 



Residence 
Location 


Manager 


Worker 


Custodial 


Total 


City 


7 


60 


12 


79 


Suburbia 


13 


20 


3 


36 


Count 17 


5 


20 


10 


35 


Total 


25 


100 


i'' 25 


150 



Is residence location related to Job category? x^ to the rescue 1 Our 
hypotheses: 

11 

^ 1 r 

: 0 oE I 

Ha • 0 <^ E ! 

Surprised? No need to be. We^re again asking whether the observed 
frequencies conform to what would be expected (If the null were true). 
What is the null saying here? It represents) a condition In which there 
Is no relationship between variables ; a condition where the variables 
are Independent of one another . 

Think hard about what Is being said as you read this paragraph. It 
Is the "crux" of the logic behind the x^ application. Give yourself an 
A-f and a pat on the back (and maybe even a cookie 1) if you can answer 
the following question without reading further. ' 

307 

32:: 




Question ; If the null were true, what would be the expected frequencies 
for the table? (How would one go about determining them?) 



Answer : If the null were true, Residence and Job would be unrelated. 
The saittB proportions of people living in the three different types of 
locations would exist for management personnel, workers, and custodians. 
For exaixq>le» ignore the separate figures for the three job Categories 
and look only at the Totals in the last column on the right. Vrtiat 
percentage of workers; overall, li^e in the city? 79/150 = 52.67%. 
If the null is true, then 52.67% of managers, 52.67% of workers and 
52.67% of custodians should live in the city. Vrtiat, for exatople, is Cfte 
actual percentage of managers who live in the city? 7/25 = 28%. 
Not much support for the null, is it? But we need a statistical test 
to find this out more objectively and systematically. 

Believe it or not, the formula we need is exactly the same as 
the one used in the. last section. But, what still might present a bit 
of uncertainty is the task of finding the E's. We need 9. of them because 
our problem hai^ 3(Job categories) x 3(locations) = 9 cells. The formula 
for identifying each* f is shown below: 

# c ^ (row tot.) (column total) 
Overall total 



Applying this to our results: 

^ , Row * 

Manager Worker Custodian Total 





0 : 


7 




60 ' 




. 12 




79 


city 


E : 


(79) (25). 
150 


■13.17 


(79) (100). 
150 


=52.67 


(79) (25) _ 
150 


13.17 




Suburbia 


0 : 
E : 


13 

(36) (25) 
150 


- 6 


20 

(36) (100) 
150 


f 

- 24 


3 

(36) (i5). 
» 150 


- 6 


36 


Country 


0 i 

E ! 


5 

(35) (25) 
150 


■5.83 


* 20 

(35) (100). 
150 


•23.33 


10 

(35) (25). 
150 


■5.83 


35 


Colunn 
Total 




25 • 




100 




25 




150 



Note that E for managers in the city « 13.17. That was derived by 
multiplying the Row 3k>tal for that cell (79) by the Column Total (25) 
and dividing by the overall total (150). The same procedure was applied 
to the remaining 8 cells. And now for the formula: 



308 



32 J 



(7-13.17) ^ . (60-52.67)^ . (12-13.17)^ . (13-6)^ (20-24)^ 
■ 13.17 52.67 13. 17 6 "^24 

(3-6)^ . (5-5.83)^ . (20-23.33)^ . (10-5 > 83) ^ 
6 5.83 23.33 5.83 

- 2.89 + 1.02 + .10 + 8.17 + .67 + 1.5 + .12 + .48 + 2.98 

- 17.93 ' ' 

Is that result significant. I.e., do we reject the null (a - .05)? • 
What we need to know ie d^. 

In the two-way x^t M determined by the following expression: 

dlJ - (c-niR-/) 

•y. 

Where C - nunober of columns and R - nunfcer of rows 
Thus - (3-1) (3-1) ( 
- A 

What is the critical value for ^5 ct(J« 4 ' 

It Is 9.49 (makie sure you can find this on Table III). 

Decision: Rejeet the null. The two variables do not appear Independent 
since the computed value 17.93 > critical value 9.49. Looking at the 
table, what Is your Interpretation of the relationship? 

Here's ours: Managers tend to live In sub'urbla more than Is usual, 
woi^kers more In the city, custodians more Ih the country. 



309 



324 



Final Notes ; In cise you* re developlr^g^y doubts about your ability 
to remeinber "all" the different afjfplications and formulas, here's^ 
a sumKoary chart ta0^elp you put everything in place. 

I. One-Way Analysis (one variable) 

A. Test HqX 6°Ef for either: 

1. Equal frequencies \ 

A. 0 = the ^'s observed 

B. E - the expected nuiAer if all categories were equal 

2. Goodness-of-f it 

A. 0 - the «l5's observed 

B. E - the expected (i's if the proportions in each 

category conformed to some hypothesized (or 
known) distribution* 
2 

B. Formula: * ^ 

C. di ' k-) 

D. Special Case: 

1, If dfj - 1, use Yates' correction formula (see p. 305). 

2. O's and E's can be ^'s, proportions or percentages, but 
they must both be the same type of score. 

II, Two-Way Analysis (two variables) ^ 

A. Test H^jOE where > 

1. 0 - the frequencies actually observed, 

2. E« the frequencies expected if the two variables are 
independfent (unrelated) • 

r ^ (row tat.) (column tot,) 
Overall Total 
2 

B. Formula: x^ ■ ^ 1^^^ 



C. di - (C-?) (R-?) 

D. Special case: 

Use regular formula even if dtJ " 1. 

2. O's and E's must both be the same type of score. 



7 



310 

ERIC . ' 32 o 



10,3 POSTTEST (answers in back) 



1. 



A researcher hypothesizes that th& marital status of wbnien is 
related to their eiiq)loyiDent status. After surveying a random 
sample of 200 women, the following results are collected: 





Employment Status 






Marital Status 


Unemployed 


Employed 


Employed 


Total 






Part-time 


Full-time 




Single 


10 


20 


30 




Married 


60 


50 


30 

— 1^ 


140 


Totals 


70 


70 ^ 


60 


200 


Use the two-way 


analysis to 


test the independence of 


variables 



at a - .01. What is: 

a. the computed x^l 

b. di ? 

c. the tabled x^? ^ 

d. the decision? 

2. The researcher (from ^I^l) repeats the study using men^. He finds: 

* Employment Status 

Marital Status Unemployed Employed Employed Total 

Part-time Full-time 



Single 


10 


20 


40 


70- 




Married 


% 15 


40 


75 


130 . 




Totals 


25 


60 


115 


200 


i 



What is: 



a. the computed x^l 

b. the tabled x^ for a 

c. the decision? 



.05? 



. Unit X Review Test 
(answers in back) 

1. Differentiate between a on6-way and a two-way x^ 
example of each. 



analysis. Give an 



O / 311 



Dlffercritlate between teats for equal frequencies and goodness- 
of-flt. Where do they dlffejr: folnnula, O's, E's? Explain. 



The athletic director of a university wonders whether attendance 

at basketball- games Is similar for students at different class levels. 

He collects attendance data and finds the following; 

Freshmen Sophomores Juniors Seniors Total 

20 30^ I 25 25 100 

ShoulcT he conclude that the classes" differ? ^ 

a. What test Is appropriate for that hypothesis, equal jj's 
or goodness-of-flt? 

b. What Is the computed x^' 

c. What Is the tabledlj^, for ct - .05? 

d. What Is the declslont ^ 

The math Instructor at the university wonders whether attendance at 
the games conforms to the make-up of the ^tudent body In the following 
majors; Arts and SclencesT; 15%, Business 30%, Education 2tf%, Others 35%. 

The data he collects Indicates the fdllowlng attendance outcomes; 

Arts and Sciences business Education Others Total 
% 30 10 30 30 100 

Does the attendance distribution conform to' how majors are distributed 
at the university? 

a. What Is the computed x^? 

b. . What la the tabled x^t for a - .01? ^^"\ 

c. What Is the decision? 

d. interpret thfe meaning of that decision with regard to the 
^research question. 

Use. the following results to test the Independence of 120 Indivi- 
duals* preferences for political party and for newspapers In a 
certain town. 



» * 



'312 





* Newspapers ^ Republican 


Democrat 


^yndeclded 
or other 


TotOT 




. if' Press .Minotaur 30 
Dally Plapet 10 


15 

30 ' 


' 25 
lOF" 


70 
50 




Totals 40i 


45 ^ 


35 


120 



^ a. What Is the decision based on x^' results? (a - .05) 

b. Interpret^what It means with regard to the research 
question. 



0 




/ 



n 



\ 



ERIC 



. 32 



UNIT XI . 

* 

INTRODUCTION TO ANALYSIS OF VARIANCE 



A. . General Ob.lectives 



This unit* is designed to provide a transition point from the present 
book's introductory treatment of ^statistics to the more advanced treat- 
ment likely to be encounteyred in «tsequen||[^courses . The selected area 
for the t,tansition is "analysis of variance" (ANOVA)---a highly useful 
tool in statistical analysis, but (depending on instructor prefei5;ence 
and student level) not always a topic that can be adequately' covered in 
a beginning course. Thu^, with the goal being to provide a transition 
rather than a thorough treatment, the present unit on ANOVA will be 
relatively brief, placing emphasis on its rationale, exemplary uses, and 
interpretation rdther than on its matheinatical properties. 

t 

B.' Specific Objectives . - 

11.1 Describe the rationale for analysis of variance (ANOVA) designs 

^ 11.2 Describe the general procedures used in j:omputing ANOVA ^ 

^ 11.3 Interpret the statistical and research implications of ANOVA 
results . 



11*1 Instructional Unit ; Using analysis of variance in research 

The present unit is intended as a bridge to help-you to step more easily 
from the introductory topics covered thus^far to th'^" more advanced ones 
likely to be encountered in later courses (you were planning! to continue 
your study of statistics, weren't you?) , Interne courses (textbooks) 

have the habit o£ dashing through introductory topics at breakneck speed 
only to set up camp and travel at a more leisurely pace withkrf a new 
territory — analysis of variance (ANOVA) designs . Uncomfortalble in these 
new surroundings, students may frantically dust off their ol^ introductory 
books searching for a friendly road map to connect the new wjith the -old* 
But one of two things of ten>happens . Either the road, map isn't there 
(ANOVA isn'r covered); or the old map looks almost exactly t^^ same as the 
new one-^-sdhe explanation, but mostly formulas and math appljications 
requiring too much "decoding" to provide the direction rieede'd. Tlie fol- - 
lowing approach is intended to be different. Its focus willj be on practical, 
user-oriented concepts that ^should (a) be understandable eveb long after you 
^forget symbols and formulas J^many years from now), (b) provide a helpful 
'transition to more advanced study, and <c) leave you capable' o^ using/ 
ipterpreting simple ANOVA designs before initiating that pro'gression. 



Why ANOVA? Couldn't They Have Stopped With the t Test ? 

Picture the following situation. A health specialist is 
testing the effectiveness of five exercise programs on weight loss. The 



concerned with 



\ 

ERLC 



314 



programs consist o f: (1) jogging, (2) swimming, (3) weight lifting, 
(4) push-ups, and (5) aerobics. So, rementerlng things that we did In 
Untt IX, you might recommend obtaining a Baxaple pool of, say, 100 eligible 
men, and rsmdomly assigning each subject to one of the treatments (20 tn 
each). We Would then go ahead and compare each treatment mean to each of 
the others, using the t test for Independent samples. The nunter of unique 
colli) inations of pairs would be 10: Sample 1 vs. 2, 1 vs. 3, 1 vs. 4, 1 vs. 
5, 2 vs. 3, etc.). Findings would indicate which sample means were signi- 
ficantly higher than otjjprs. We could then make inferences about the 
associated populations (i.e. , which exercise program, if given to all 
eligible people, woufd be best)."" > 

But suppose that in reality all programs are exactly the s^me in effec- 
tiveness ; each should result in an average weight l^ss of 10 lbs. The 
researcher, ,jDf course, doesn't know this. If he did, there would be no 
reason to do the experiment! * 

Big Question ; If he runs his analyses, which is the only type . 
of error (I or II) that he could jnake? 

^ Small (but important) Answer ; Type I 

' Given that there are actually no differences, he'll be making a correct 
decision if he retains the null. If he rejects it. Type I all the way! 
Can you see a problem for the Type I error rate when multiple X tests are 
used? Remenijer, this re^arch ran 10 tes^ to compare all unique pairs 
of means. M ^ * "X. 

To get you primed for identifying theV problem, try this questlotj. What 
is the Expected Value, E{K) for the Swimming Treatment mean? Answer: 10 
lbs. (see above). What, then_ls E(X) for the Aerobics Treatment? Answer: 
10 lbs. In fact, an five E(X) 's are 10, since all treatments are 
equally "Effective (and the parameter value was defined as 10). 

Will the actual Treatment Ts all be equal to 10? No!! Sampling fluc- 
tuation and random error will Inflate somen's and reduce others. Here s 
the key point (at last!) ; Isn't It true, then, that the more treatments 
(samples) you compare, the greater the chances of obtaining at least one 
weird (extreme) XjKeven though the particular treatment. Is not different 
In Its effects f rort any other?' Thus, the more multiple t comparisons you 
make, the greater ^he chances that at least one Type I error will be made. 
Think about it one more time in terms of this rather extreme example: If 
we tested 100 different random samples from the same population, isn t^ it 
very likely that some will "appear" to represent different populations 
(at a •= .01 even!)? Whal makes them different? Sampling' fluctuation is 
the culprit. Each t test holds the Type I error rate at .05 for that one 
comparison. But what isn't held at .05 is Type I error rate across all 
comparisons (4,950 unique comparisons in the above exarssple) . That error 
rate will be much larger tha^i 5Z. 

Now'^for the answer to the question we started out with. Why ANOVA? 
The reason is that ANOVA allows us to conduct an "overall" test of 
differences between treatments. ANOVA holds the chai^ces of falsely 
concluding that treatments differ from one another at a (e.g. .05, .01). 
The null hypothesis statement' for our example thus becomes: 



315 *^3[) 



If retained , the suggestion Is that the population means represented by 
the sample (treatment) means do not differ. If rejected , thd suggestion 
Is that there Is at least one difference between population' means. What 
Is that difference? ANOVA doesn't say, since It Is an "overall" test, 
but follov-up analyses can be used to probe that Issue (those are easy to 
perform, but describing them would take us beyond the scope of this text- 
bodic). How does ANOVA work? We'll turn to that question In the Immediately 
following section. First, try your hand at the following "thought" questions. 

POSTTEST (answers In back) 

i. What Is the problem with comparing more than two treatments using a 
t test for each unique pair? 



2. What happens to the Type I error rate as additional treatments are 
added to the design and compared to one another? 



3. Four treatment groups are con?)ared through ANOVAr State the null 
hypothesis. 



4. Suppose the null hypothesis (from #3) Is retained. What Is the Im- 
plication about the population means In question? Suppose It Is 
rejected. What Is the Implication then? 



316 3 3 



11.2 Instructional Unit : Understanding general procedures for 

cotnputlng ANOVA 

In previous units we use^ statistics called z, and in testing 
hypotheses. The logic was to compute the given statistic from a mathe- 
matical formula and then compare the result to a probability table 
(Tables I-III In the Appendix). The table showed the probability of 
obtaining different values of the statistic, given that the null hypo- 
thesis was true . 

<i> 

The ANOVA procedure also Involves computation of a statistic. In 
Its case, the statistic Is called F and Is derived from the following 
ratio: 

VoAAMbUAXy {^iXhin gnoup6 

Don't give up—glve this coricept a chancel By "variability between 
groups" we mean how much the treatments (or groups) differ from one 
another on the criterion. We would determine this specifically by com-, 
putlng the variance (o) between the grqup means (X'e) . But for now, 
think about It this way. If we were comparing the effectiveness of th^ee 
study procedures on test performance, we might obtain results , like X| = 80, 
^2 70, and 100. Someone else might compare three different proce- 
dures and get: 7} » 80, 82, and 80. It should be obvious that 
variability between groups is larger in the first study than in* the second. 
Without knowing the actual numerical value of that variance (or the value 
of the denominator of the F r^tlo) it seems likely that the first analysis 
will produce a larger F value than t;he second. Just like for tf Z, and x »' 
the larger the value of the statistic, the less probable it becomes, and 
the stronger the basis for rejecting the null. In this case: 

So, at this point, the null looks less secure in the first study than it 
does iny^e second. 

Turning to the denominator of the ratio, what does it mean? By 
"variability within groups" we mean how much the scores within the treat- 
ments themselves differ from each other. This should be a familiar concept 
because it has come into play every time we have computed the variance or 
standard deviation for a set of data. So, the F formula really Involves 
analyzing "differences between groups" relative to "differences between 
people" within those groups. 

Now, let's turn our thoughts to some of the beautiful logic that under- 
lies the F formula. Think about this for a moment: What factors would 
account for people within treatments obtaining different scores? After 
all, all receive the same treatment. 

If you're thinking along the lines that some are brighter, some try 
harder, those same descriptions (brighter, trying hard) might fit you as 
well! People differ, so there's going to be variability in their scores. 
This is variability over which we have no control. Statisticians call it 
random error . 
» 

317 

332 



So, let's ask the question again, using sllgjhtly different words. 
What factor accounts for the denominator of the F ratio? Answer: 
Random error. 

r „ VcvUab^iJUXy bg/ftveen gAX)up6 
Kandom vi/lok ^ !^ 

Ready to take another step In our journey? Try this one. Assume 
that the ntill Is false/ and Identify the factors that would account for 
group? (or treatments) having different means. 

One very good answer would be that If the null Is false ^ the groups 
would differ because their respective populations differ, i^other way of 
sayldg It, Is that there Is an honest- to-goodness t reatment effect . ^ 
(The groups differ because they receive different treatments and the 
treatments differ In effectiveness.) But Is this the only factor that 
would lead to group variability? No — there's one more. Can you Identify 
It? 

The second factor Is .... random error! Just like people can differ 
within groups due to uncontrollable factors, groups are affected by the 
same properties. (We called this sampling fluctuation In earlier discus- 
sions.) So, when the null Is false , the F ratio Is determined by: 

p ^ Random qajiok ^ T/imtrntnt tj^zct 
Random 2AJWK 

0 

Since there Is no reason to assume that the two random error estimates 
differ significantly, we can begin to make a prediction about the nature 
of our computed F. What would It be In your opinion: (j^) a number equal 
to 1.00, (b) a nuiid)er less than 1.00, or (c) a nuid^er greater than 1.00? 
Looking at the expression, the expected value of F would have to be 
greater than 1.00. So, the larger the treatment effect, the larger the 
r value will tend to be. And, on F tables (one Is not Included In this 
book), you might expect "statistically significant" F*s to be ones with 
values greater than 1,00. That Is the case! . (How much greater, though, 
depends on other factors.) 

-.^ ^ . ^ . \ 

Now let's assume that the null Is true . What factor would account for 
the denominator of the F ratio? Random error again — no change here.. 
What factor(s) would account for the numerator? Random error another 
time! But what about that treatment effect we talked about earlier? It 
doesn't appear this time because If the null Is true, no treatment effect 
exists. This leaves us with the following as an expectancy: 

P ^ Random vxkok 
Kandom vuioK 

So, when the null Is true, what Is the expected F: (a) a value of 
about 1.00, (b) a value less than 1.00, or (c) a value greater than 1.00? 
tooklng at the expression, "a" should be the obvious choice. When ANOVA 
Is run, F values close to 1.00 are suggestive of "nd treatment effects" 
and Justify retention of the null. 



ERIC 



318 



33j 



In real life, of course, we don't know whether or not the null is 
true. All that we can do is conduct our study, collect our results, and 
analyze them through the F formula. The magnitude of the F value com- 
puted will then be used to infer whether or not a treatment effect is 
likely to be operating. 

Suna of Squares and Degrees of Freedom 

i The above non-mathematical conceptions of the F ratio are useful for 
conveying the logic involved in the procedure. But what numerical values 
actually represent the different types of effects? Here we must resort to 
some mathematical relationships whose logic may not be so obvious to the 
beginning student. But give them a try. You might surprise yourself. 

Remember the opening example involving the five types of exercise 
treatments? There were 20 subjects randomly assigned to participate in 
' each program. Let's say that after all have participated, we average the 
scores of the total sample of iOO, as if treatments didn't exist, to ob- 
tain d grand (overall) mean. In this example we'll pretend that ^;totqZ " 
6.2 lbs. (weight loss). Ndw, continuing to ignore treatment menbershlp, 
we compute the sum of the squared deviations of each score from the over- 
all mean, just the way we, would have in any Unit III problem: 
i ..■ ' ■ 

IX^ = E(X-X. J. )^ (Subtract the total mean, 6.2, from 

each score, square result, and sum 



OX 

ZX^-(EX)^ 



all such results.) 



where n = 100 

In ANOVA, the obtained value has a name, sum of squares total. 



symbolized 



total 



By a surprisingly simple mathematical proof (which we won't demon- 
strate) , it can be shown that SS^^^can be partitioned into two components, 

symbolized because of the yay they are computed as ^S^^^^^ and ^^^^^^^y^. 

55*. A ^8 derived by summing the squared deviations of group means 
about ^^'^ the overall mean. The farther the group means are from one 
another, the larger SS^^^^^ will be.* 

- y )^ (Subtract the total mean, 6.2, from each 
g/ioup tot. group mean, square result, sum all such 
results.) 

*ln actually computing SS.^ , the sum of the deviation scores are 
weighted by the number oT subjects in each group: ^^lA-A^^J . 

(See page 328 for more detail.) 



319 



^UOitkin ^® derived by adding the squared deviations of scores about their 

respective group iqeans. It is influenced by the extent to which indivi- 
duals in a group differ from one another (remeinber random error?). 

If you can make sense out of these expressions, fine , If not, don't 
worry. It's the procedure that's iipportant in this treatment of ANOVA. 

Vfhether or not you can translate the expressions, perhaps you perceived 
their relationship to the components of the F ratio Introduced earlier. 
Your tendency might be to say, "I got it! You compute SSi and 5S t , 
divide the former by the latter, and you got Fl" Well, alffiost, but ^^^^ 
not quite. Variances , you may recall (from Unit III), are averages. V 
Analysis of Variance (ANOVA) , therefore, implies that Just "summing alongf' 
(as done to get the various SS's) is not sufficient. 

Specifically, each SS must be converted into an average (mean) tp get 
the entries for the final ANOVA formula. What was the appropriate divisor 
(d0 used in Unit III to get ^, the sample variance? It was n*}. The 
same principle holds here: Since SS^qX. derived from the squared devi- 
ations of treatments means (let's call their number a) from the overall 
mean, cfj^^^ « a-J (# of treatments minus one). 

^^uUJthin derived from the squared deviations of scores from their 
treatment means. In each treatment, di * n-l. If there are a treatments, 

^^tnt ^® based on the squared deviations of scores from the overall 

mean: di^^^ » an-J. ; \ .^^ / ^'^^^^ ' 

tot • ^ - 

Here's a summary; 

«. 

In our example involving 5 treatments and 20 subjects in each: 
s 4 

o 95 

i5)[Z0) - I 
» 99 



320 

33o 



Note that 95+4-99. 

Remeiii)er we said that: SS^^ « ^^bU * .^.^uiWUn 
This relationship is also reflected in d^j s. 

In any case, now we are ready for the actual computation formula: 



A simpler and more commonly used notation is to express the two com- 
ponents as "Mean Squares" (MS) . 



So, we compute F jjy: 
F ^ ^bU. 



F Summary Tables 

When ANOVA is computed, the results are normally presented in a summary 
table. As a matter of fact, computerized data analysis programs provide 
these as standard output. They appear as follows: 

Source . SS df. MS F 

Be^een 620 4 155 ^.07* 

WWUn 420 95 4.42 

Total 1040 99 



*Significant at .05 level 
The nunfcers filled in are hypothetical r.esults for our "etiercise" study 
Note that when SS, .^..^ and SS^j. were computed, results were 620 and 
420, respectively?^^^ SS^^ '^^'^ mO, which is the sum of the between 
and within components. Since"^' there weie five groups, d^^g^ " J- By 
usinjlhr formula, a{n-1) , dU^ - 95\^ividing each SS by di gives 
us the values in the MS columST^ F is the ratio of those MS values. 
The asterisk next to the 35.07 tells the reader that the F value was sig- 
nificant. (Remember a value close to I. 00 is expected if there is no treat- 
ment effect'.) This study reflected significant variation between the five 
treatments. We can infer that a treatment difference occurred. Examination 
means (and follow-up tests) will later be needed to tell us exactly where 
the differences lie. 



321 



\ 



ERIC 



Shown below Is a partially completed summary table for a study In 
which three different teaching strategies were compared In terms- of stu- 
dent achievement on a final exam. The number of stud^Oj^asslgned to each 
treatment was 16. Can you fill In the missing values (ittLcated by 
question marks In parentheses)? 



Source 






MS 


F 




42 


(?) 


(?) 


'(?) 




(?) 


(?) 


(?) - 




Total 


138 


(?) 







A lot of question marks, rfo question about that! But all the Informa- 
tlon you d6ed to derive every single vdlue Is there. First, If we know 
that SStot. " 138 and that SSu^ =» 42, then SS^.^, - « 138 ^ 42 « 96. 
The d(J's are a - / » and aTn-I) - SiH-l) - ^t^^^ for the between 
and within components. Total d|{ » 2 .+ 45 « 47. 

. '^bU. 



TT3 
- 9.«6 

You would need an F table to tell you whether that value Is significant. 
In case you're curious, It Is significant (a = .05). We would conclude 
that there ±b^ a treatment effect. 

, Again, what we have tried to do In this section Is give you the basic 
logic and procedures Involved In ANOVA (very little emphasis has been put 
on explaining calculations). The goal Is to enable you to Interpret 
ANOVA results that yoii might read In research reports, and to progress to 
more advanced courses (should you so desire) having had some background 
with the ANOVA procedures. Try these posttest questions to test your 
understanding of the material In the section. Then progress to the last 
section (11.3) which provides a brief review of how to Interpret ANOVA 
results* 



11.2 POSTTEST (answers In back) 

1. The underlying logic In ANOVA Involves the following ratio 

In your own words, explain what the ratio means. 



322 



33/ 



0 



ERIC 



2. In reference to the expreaslon shown in IH, indicate (a) what factor (a) 
account for the denominator (regardless of whether the null were true 
or false) . y 

(b) What factor(s) would account for the tiumerator if the null were true? 

(c) What factorU) would account fcg the numerator if the null were false? 

(d) What would the Expected Value of F be if the null were false: 

(1) > 1.00, (2) < 1.00, or (3) about « 1.00? Explain. 



3. What is the interpretation regarding treatments (or groups) if the 
null hypothesis is rejected? 



A. Suppose the four treatment groups of 10 subjects each are compared by 
ANOVA. 

a. What is aifj,^ ? 

5. Replace the question marks in the following ANOVA summary table with 

the correct values. 



Source 


SS 




MS 


F 




200 


4 




? 


Within 


? 


40 


2 




Total 


1000 


? 







323 



33, 



11.3 Instructional Unlt > Interpreting ANOVA results 

This sectlcm Is desired to boost your confidence that you can look 
at an ANOVA summary table and Interpret the findings. While It is ttue 
that the present chapter provides very little exposure to calculating F 
from original data, it is also true that in practice, hand calculatloni3 
are rarely done given^hat there is any access to computers. The 
computer data analysis package (e.g. SPSS) tells you how to enter the 
data and what "control" information Is needed for the run. The output 
may consist of many statistical indices, but In the case of ANOVA, the 
important information Is the ANOVA summary table. If you can Interpret 
it, you are well on the way to understanding what occurred In that part 
of the study. 

To give you practice at doing this, some ANOVA summary tables from 
a hypothetlGal study will be presented for your consideration. 

^"^ Description of Study 

High school students were randomly assigned to three different 
learning treatments, 17 per treatment group. One treatment presented 
statistical material using education- related examples. Another pre- 
sented the saiae material using sports examples. A third presented the 
material using abstract examples (A+B«C, etc.). The groups were compared 
on a number of different criteria. Below are some results: 

Analysis H 

In one analysis, the groups were compared on their ability to solve 
completely new problems Involving higher-order applications of the 
material (30 Item test). 

Results for the (hree groups were: 

X 

Education 23.5 
Sports 22.9 • 

Abstract 15.3 f 

ANOVA Summary , 



Source 


SS 




MS 


F 


Between 


71.76 


-2 


35.88 


3.33* 


Olltkin 


516.47 


48 


10.76 




Total 


588.23 


50 






^Significant 


at .05 




— ^— r — 





(Can you figure out how the d({ were derived?) 
What happened in this analysis? 

Ans . : The significant F Indicates a significant treatment effect. 
The null hypothesis of "no treatment differences" la rejected. Looking 
at the means, the Impression we get Is. that the Education and Sports 



< 



groups perfortasd about the same. The Abstract group, though, "bombed 
out." (Although follow-up tests will be needed to confirm that, such a 
description seems a reasonable account of the data.) Conclusion: The 
Abstract presentation was a relatively poor teaching device. 



Analysis 

In another analysis, the three groups were compared on their 
retention of formulas. Results were: 

. y . \ 

Education 21.76i 
Sports 24.12' 
Abstract 22.35 

ANOVA SUMMARY 



Source 


SS 




MS 


F 




5.10 


2 


2.55 


.13 (not significant) 


^Within 


936.47 


48 


19.51 




Total 


941.57 


50 







Conclusion? There were no significant differences between groups. 
The null hypothesis should be retained. If you examine the means, you 
can get a further idea of why statistical analyses are needed. Without 
a statistical test, what's to prevent the researcher from concluding that 
the 'Sports treatment (which produced the highest mean) was superior for 
learning formulas? The statistical test (ANOVA) let us know that such 
"superiority" could easily have been obtained by random error (chance). 
The value of the coii5>uCed F is not large enough to suggest a real treat- 
ment effect. 

• .'i » " 

Analysis 93 

In the last analysis, the three groups were compared on attitudes 
toward instruction. Scores were derived from a survey, where 8 was the 
highest (most positive) score and 0 was the' lowest. 



Results were: 



Education 5.53 

Sports 6.71 

Abstract 4.47 

* ANOVA SUMMARY 



Source 


SS 




MS F 




42.51 


2 


21.26 7.39* 




138.00 


48 




Total 


180.51 


50 


2.88 ^ 


*Sigi^i£icant 


at .05 







325 

ERJC • • . 34,j 



.3 POSmsST (answers In back) 

^terpret the results of Analysis 03. » . 























• 








V 











Unit XI Review Test 
(answers In back) ^ ^ * * 

1. Describe the effect on the Type I error rate as additional compari- 
sons between pairs of means are condt|cted via t tests. 



2.1 As differences between* treatments increase, the size of the F value 

In ANOVA tends to 

J 

a. Increase b.' decrease c. remain tHIe same 

3. As the number of treatments compared via ANOVA Incre^ges, 

a. lncrease3 b. decreases c. remains the same 

A. As the D^unber of treatments compared via ANOVA Increases, with a 
held constant, tne chances of a^pe I error 



t a. Increase b. decrease c. remain the same 

/ 

.5. A8 additional subjects are added to treatments, with everything else 



held constant, ^^jf^otaZ 



a. Increases b» decreases c. remains the same 

6. As additional subjects are added to treatments, with everything 
else held constant, ^iJ^^^ 

a. Increases b. decreases c. remains the same 

J. As treatment effects Increase, with everything else held constant, 
the amount of random error 



a. Increases b. decreases c. remains the same 



326 



Complete the following' ANOVA Summary Table fdV a study Involving 
four treatmSnts. with 18 ^subjects in each. > 



Source 


SS 




MS 


F 




60 


? 


? 


? • 




? 


? 


? 


\ 


Total 


650 


? ■ 







If the F probability table sbws the critical value (at a « .05) 
neede'd to reject the null as 2.75, what is the decision In 061 

^at is the ^anlng of that decision with regard to treatments? , 



(Sefi pp. 328-329 far "optional'* explanations on F ''computations 
and critical values.) ^ 



/ 



327 . 

3i4 



SOS foX--SSS 



In case you are suffering from Statistics Symbol Shock (SSS) and 
fhave no Idea what the different SS formulas mean (i.e. J you have SSS 
from the SS's), here's an explanation. (Keep In mind tfhat there are 
simpler to use computational formuias — but" these lose Risible asso- 
ciation with the SS definitions in the translation.) 



SS 



total 



1. Subtract the total mean, ^T- - «f 

(regardless of groups) 
2% Square each resultant deviation score 
3. Add them to get 



from each score! 



V 



SS 



bet. 



gAoup totaJt* 



1* Subtract the overall mean, ^^^^^^ from each group mean. 

2. Square each resultant deviation score and mull^iply it by 
n, the number of ^subjects in the group. j 

3. Sum the values from Step #2 and you have* SS/ 



.1 



1. Subtract the Grbup 1 mean, Xj, from each score in Group 1. 

2. Square each resultant deviation score* | 

3. Add the deviation scores to get E(X-X;)^ j ^ 

4. ftepeat the process for all groups ending with the last one. 



5. Add the group SS's to get SS 



34 



328 



F TABLE MANNERS 



As previ|.oii8 mentioned, your book does not include an F probability 
tablf. The reason is simple. No problems are included that necessi- 
tate use of the table. Also, the table is quite long, containing much 
more information than needed for t or x^- Why? That reason is also - 
simple. F probabilities vary on the basis of two di values, H^^^jf^ 

and dL * . Thus, the F table must show critical (a = .05, .01) 
values for cont>inations of the two d(J's. For example, if "6{,g^^ " 2, 
there will be a differeiit critical value for each possible value of 
'^^mJtkin ^^^^'^^ infinite number!). F tables, being of sound 

mind, only report values for selected d^'s (like every 5th one up to 
100), but the result is still a pretty long table. 

Should you need to consult an F table, do it with enthusiasm and 
excitement. Using one is easy! Columns (movXng across the page) will 
correspond to different d^{,g^^ values and rows (moving down the t»age and 
onto other pages) will correspond to different d^^^^^ values. Find 
the ones that coi»respond to your d^'s (if not found, use the next avail- 
able lower ones reported) . The section where the column and row cross 
will report critical F values for commonly used a's^<.05, .01, etc.). 
There is nothing to it! . 



ERIC 



329 



34 -i 



FINAL, FINAL POSTTEST 



Select one of the alternatives provided In response to the 
following question: 

QUESTION: Now that you have finished your Introduction to 
statistics (as provided In this book), your reaction Is 
one of 

a. "At last It's over!" "A horrible experience that should 

be avoided at all costs!!" 

b. "It was bearable » but about as Interesting as reading 

the Phonebook." "Never again, unless absolutely 
necessary." 

c. "Well, It's not as exciting as reading a n^stery novel, 

but the stuff was far more doable than had been ex- 
pected." "If I had the oppOTl|unlty to take a more 
advanced course, I might even consider It." 

d. "It's been the event of my life; I am now happier, 

look better, and have more friends a result." "My 
major will be changed Immediately to^statlstlcal 
theory or nuclear physics." 

e. all of the above. 



We hope that If your response was not "d," It was at least 
"c." Statistics hardly makes for exciting, provocative 
reading. But, It does Include concepts that are becoming 
Increasingly prevalent In our very data-oriented society. 
If you feel, as a result of this course, that those concepts 
are leamable, useable,, and need not be threatening to the * 
"non-mathematlclan" (which may Include yourself) , we have 
accomplished our maj^or objective. If your answer was "a" or 
"b," we're sorry - but Tt's not because we didn't try. You 
can still put your book to good use, as we hear It makes a 
tolerable frlsbee or seat pad (for short people). 

\ 



3 




ERIC 



330*^ «J 



Answer Section 

. ./ 



( 



\ 



34 d 

331 



ERIC 



ANSWER. SECTION 



UNIT A 

Unit 'A Review Test 



1. 


59x 


14. 


19 


2. 


-8x 


15. 


12 


3. 


-38x 


16. 


9 


4. 


llz 


17. 


9 


5. 


-26x 


18. 


201.64 


6. 


5 


19. 


1/9 


7. 


88 


20. 


87Z 


8. 




21. 


29 


9. 


-04.0 


22. 


10 


10. 


2.5 


23. 


285 


11. 


31 


24. 


100 


12. 


26 


25. 


99 


13. 


28 







UNIT I 



1. 1 POSTTEST 



1. 10 in. : 9.5 In. - 10.5 In. 
5 in.: 4.5 in. - 5.5 in. 

2. 1.372 sec : 1.3715 - 1.3725 
1.300 sec: 1.2995 - 1.3005 

3. VlO.l: 10.05 - 10.15 
• 11.0: 10.95 - 11.05 



1.2.1 POSTTEST 



cf 



1.00 3 34 

.90 5 31 

.80 1 26 

.70 2 25 

.60 5 23 n - 34 

.50. 4 18 

.4ii 2 14 

.30 3 12 

.20 3 9 

.10 4 6 

.00 2 2 



332 



34,' 



1.2.2 FOSTTEST 



X 


f 


cf 


40-44 


1 


33 


35-39 


5 


32 


30-34 


3 


27 


25-29 


0 


24 


20-24 


2 


24 


15-19 


4 


22 


lOP-14 


5 


18 


5- 9 


ft 


13 


0- 4 


4 


4 



(this is only one of several 
possibilities) 



333 343 

ERIC 



1.3.1 POSTTEST 



a. 




1.3.2 POSTTEST 



a. 



5 
4 



b. 



12 
10 
8 



18 19 20 21 22 23 24 25 

X 



4 7 10 13 18 19 22 25 



ERIC 



335 



35.J 



1.4 POSTTBST 



a. bluodal, positively skewed 

b. bell-shaped » unlinodal» symmetrical 

c. unlmodal, positively skewed 

d. triangular, unlnodal, syometrical 

e. blnodal, negatively skewed 

f. rectangular, syvaetrlcal 

g. blnodal, symxietrlcal 

h. unlmodal, negatively skewed 

1. rectangular, amodal, symnetrlcal . , , ^ 

1. blmodal (Intervals 29 - 31 and 20 - 22), positively skewed 

Unit I Review Test 
1. 



n - 20 



It would be Inappropriate to use a grouped frequency 
distribution since you are only dealing with five scores 
and grouped distributions are usually reserved for dis- 
tributions where It Is Impractical to list every possible 
score . • 

2. 17.5 - 18.5 

3. 19.5 ' 

4. The upper limit of each &core would be used to define 
the horlEontal axis (I.e., 15.5, 16.5, 17.5, etc.). 

5. Bell-shaped. 



X 


f 


cf 


19 


2 


20 


18 


4 


18 


17 


8 


14 


16 


4 


6 


15 


2 


2 



UNIT II 

2.1 POSTTBST _ 

i 

a. A percentile rank Is a nuixl>er that represents the 
percentage of scores that ^all at or below some 
specified raw score. 

b. When we say that a student performed at Pgg, we mean 
that his raw score on the test or measure was equal 
to or higher than 89X of the remaining raw scores. 

In other words, this particular student scored higher 
than (or the same as) 89Z. 



337 





i 




\ 

\ 
\ 
\ 
\ 

\ 

\ ' 
\ 

\ 

\ 


c. John did relatively better on the English test. Even 
though his r awr score. 96, was higher on the math test, 
his percentile rank Indicates that he only beat 79Z 
of his classmates. On the English test, however, his 
raw score of 42 was higher than 96% of his classmates' 
scores. 

2.2 POSTTEST 

Set 1 Set 2 
% a) 49-48.5 - cf^ - 21 a) 17 - 14.5 - cfx - 19 






1 9 3 4 






cfx - 25.5 cfx - 22.33 






P.R. - 25.5/38 X 100 - 67.1 P.R. - 22.33/25 x 100 - 89.3 






answer: Pgy.i * answer y ^89. 3 






b) 47 - 46.5 - cfx-10 h)^ - 8.5 - cfx - 8 
14 3 3 






cfjj - 12 cfx - 8.5 






P.R. - 12/38 X 100 - 31.6 P.R. - 8.5/25 x 100 - 34 






• 

answer: 6 answer: P3^ 




- 


c) 44 - 43.5 - cfx - 2 c) 7 - 5.5 - cfx - 5 






1 2 s 3 3 






cfx - 3 cfx - 6.5 






P.R. - 3/38 X 100 - 7.9 P.R. - 6.5/25 x 100 - 26 






answer: P7.9 answer: P26 










ERLC 








■* J 





2 . 3 POSTTEST 
Set 1 



a. Pjq: 



.30 - c£^/N - c£^/40 



cfx - 12 



X - 41.5 - 12 - 7 
1 6 

X- 42.33 

answer: P30 - 42.33 

.60 - c£x/40 

X ^ 43.5 - 24 - 21 
1 10 

answer: P50 "43.8 

c. P90: 

.90 - c£x/40 



cfx - 36 



X g- 45.5 - 36 - 35 
1 3 

answer: P90 " 45.83 



Set 2 

a. P30! 

.30 - cfx/N - c£x/35 




cfx - 21.0 



X - 11.5 - 21-20 
3 6 

answer: Pgg ■ 12 
P90! 

.90 - cfx/ 35 

cfx - 31.50 

X - 17.5 - 31.5 - 31 
3 4 

answer: P90 " 17.88 



ERIC. 



339 



354 



2.4 POSTTEST 



Central tendency Is one of several measures that Indlcattes 
the iwst typical value In a set of data. 



ERIC 



2.4.1 POSTTEST 

1. Mean is the point of balance In a distribution; ^the 
deviations below it equal the deviations above it. 
It is the numerical average of the scores. 

2. SET 1: X - 15.7 
SET 2: X - 78.75 
SET 3: X - 7.91 

2.4.2 POSTTEST 

1. Median Is the midway point In a distribution; half 
of the scores fall above It, and half fall below It. 

2. SET 1: Median - 9 • 
SET 2: Median '%! 

SET 3: Median > 9.6 7 ^ 

SET 4: Median - 25.0 * 

2.4.3 POSTTEST 

' 1. Mode is the most frequently occurring score value. 

2. SET 1: Mode - 60 and 58 (distribution is blmodal) 

SET^2: Mode -i 41 ' \ 



J 



SET 3: Mode - 6 9 



340 



2.5 POSTTBST 
1. 



2. 



A. Median (seriously skewed dlstxlbutlon) 

B. Mean (slight skew) 

C. Median (top Interval Is open-ended) 

D. Median (serious skew) 
B. Mean (slight skew) 

F. Median (serious skew) 



A. Mean t 1 
Ud §2 



B. Mean §2 
Ud # 1 

C. Mean and Median # 2 
D • Mean * Median 

B. Mean less than Median 
F. Mean greater than Median 
Unit, II Review Teat 



1. 



a. 92. i: 

b. 84. 2> 

c. P70»6 

d. P8 

e. 87.3 « 

f. 88.25 

g. 92 s 

h. The nedian la the 

nore neanlng£ul neaaure 
because this dlatrlbutlon 
haa a aerloua negative 



3. 



a. 


A 


a. 


A 


bv 
cA 


C 
A 




B 
A 




B 


' d. 


B 


e . 


C 






f . 


A 






g* 


B 







ERIC 



341 



356 



UNIT III 



I 

3.1 POSTTEST 

Variability referg to the extent to which scores In a 
distribution bunch up clpse to a loeasure of central 
^ tendency (usually the mean) or spreiid out far from a 
measure of central tendency. 

3.2 POSTTEST 1 

1. The range Is the distance from the upper limit of 
the highest score to the lower limit of the lowest 
score. Or more simply: the range is the 4l8tance^ 
from, the bottom of the distribution to the top of 
the distribution. 

2. Set 1: range * 43 \ 
Set 2: range « 11 

Set 3: range « 25 I ] 

\ ■ - - ■■ ' 

3.3 POStTEST 



( 1. The variance is th^ average of the squared deviatioiis 
around the mean in a set of data. 



2. Set 1: a2 - 3540 - (166) ^ - 3540 - 3444.5 

8 

• 8 { 

- 95.5 

8 ; ^ ^ 

Set 2: a? - EfX^ - (ZfX)2 boi96^ ^830)^ 

N 23 



♦ N 23 

30196-29952.17 - 243g3 - 10.6 
23 23 



342 



*Set 3: 

8^ - ZfX'^ - (IfX')' 

n = 960 - (90K 

n - 1 12_ 

11 

- 960 - 675 - 25.91 
li 

*Note change In <lEnoiiilnator as a result of using 
sample data. 

3. The variances would remain unchanged. 

4. Each new variance would be 3^ times as large as tte 
originals: 

Set 1: 107.46 , 
Set 2: 95.4 
Set 3: 233.19 

3.4 POST^^T * . 

1. The standard deviation Is the square root o.f the 
variance; it is the square root of the average of 
the squared deviations around the mean. 

2. Set 1: Use n - 1 in denominator. 



9 - /812 - (66)^ I 

7_ - /812 - 622.29 'yJZl.bl 



- 5.62 



ERIC 



356 • 




Set 2: 



a - / ZfX^ - (£fX)2^ / 



1379 - -X179)- 
24 

24 



-^1783" 



1^5 



Set 3: 



r. 



a - /ZfX 



;7 



7460 - (422)^ 

26_ 

26 



7460 - 6849.38 
26 



■V 



23.49 » 4.85 



^ The new standard deviations would be exactly the 
same as the originals. 

Unit III Review Test 



1. 


419 


2. 


26 


3. 


16.12 


4. 


6907 


5. 


175,561 


6. 


5.95 


7. 


6.19 


8. 


2.44 


9. 


2.4'9 


10. 


11 


11. 


2.44 


12. 


1.22 


13. 


1.49 


14. 


14.94 


15. 


55.71 


16. 


6.19 



o ■ 
ERIC 



344 



35j 



V 



UNIT IV 



A.l PQSTTEST 

1. A score Indicates the number of standard devlatlonis a raw 
score is abov6 or below the mean. 

' 2^^. X = .91, s - 3.67; .87 - 2.33; .8 - 0.00; .72 - -2.67 

B. X - 100, z - 1.14; 99 - 1.00; 90 - -.29; 82 - -1.43; 78 
-2.00 ' \ 

C. X - 1.5, E - 1.50; -2 - t2.00; 0 - 0.00; -.4 -^.40 
it. 2 POSTTEST 



1. For z - 2.2: X - zo + X - (2.2) (15) + 100 - 33 + 100 - 133 
z - 1.0: X - (1)-<15) + 100 - 115 



, - -2-0- 


X - 


(-2) (15) + lOQp- -30 + 100 - 70 


£ - oToO: 


X - 


(0)(15) + 100 - 100 


For z = 2.00: 


X - 


(2)(.3) + 1.2- .6 + 1.2 - 1.8 


z - 0.30: 


X - 


(.3)(.3) + 1.2 - .09 + 1.2 - 1.29 


z - -.40: 


. X - 


(-.4)(.3) + 1.2 - -.12 + 1.2 - 1.08 


z - -1.3: 


X - 


(-1.3)(.3) + 1.2 - -.39 + 1.2f .81 


For z - 3.00: 


X - 


(3) (4) +50-62 ^ 




X - 


(2) (4) + 50 - 8 + 50 - 58 


z - -2.0: 


X - 


(-2X4) + 50 - -8 + 50 - 42 


z - -2.8: 


X - 


(-2.8) (4) + 50 - -11.2 + 50 -^8.8 



ERIC 



.345 



POSTTEST 



1. a. 
b. 
c. 
d. 
e. 



T - 108 + 50 - 10(2.00) + 50 - 20 + 50 70 

T - lOTO) +50-50 

T - 10(-1.56) + 50 - -15.6 + 50 - 34.4-34 

T - 10(.72) + 50 - '>.2 + 50 - 57.2 - 57- 

T - 10(-2.40) + 50 - -24 + 50 - 26 

2. a. For X - 14: z - 14 - 22 - - -1.00; T - lO(-l) + 50 

8 8 

b. For X - 22: jE - 0; T - 50 

c. For X - 26: z - .5; T - 55 

d. For X - 16: z - -.75; T - 43 



3. In order to find the raw scd 

T to z^, and then convert to X,' 




|U must first convert 



a. For T - 50: 



b. For T - 40: 



_z - T - 50 (this is the sane equation 
10 as T - 10^ + 50) 

z - 50 - 50 - 0.00 
10 

+ X * 

(0)(7) +68 
68 

40 - 50 - -1.00 



X 
X 
X 



10 



X - (-1) (7) + 68 - -7 + 68^61 

z 



c. For T •» 35: z- 35 - 50 ^ -15/10 - -1. 



10 



1^6] 



X - (-1.5) (7) + 68 - -10.5 + 68 - 57.5 



346 



38 



1 



d. For T - 62: z - 62 50 - 12/10 - 1.20 

10 

X - (1.2) (7) + 68 - 8.4 + 68 - 76.4 

,e. For T - 47; z - 47 - 50 - -3/10 - -.30 

10 



X - (-.3) (70 + 68 - -2.1 + 68 - 65.9 



POSTTEST 



1. The normal curve rSswbles a bell where the majolkty of 
frequencies are In the middle area with the smaller nuiifl>er 
of frequencies being at. each end of the distribution. 

2. a. 




true 
true 
false 

can't answer (curve refers to / 

hamb urgers not cheeseb urgers) 



POSTTEST 
1. 




2. a. I£*we plot the distribution^ the heights of the curve 
at 4*10 and **lo will be exactly equal. This is because 
the normid curve Is synetrlcal. 



b. 1- 98 

2. Neither since they are both equidistant from 
the mean. ^ ^ 

3. Yes^ It could be a possible score since you don*t 
knov how large the o around the mean Is and o 
usually spread out 3a around the mean* 



4.6 POSTTEST 




-30 -2o -lo 0 lo 2o 3o 



4.7 POSTTEST 
1. . 47.72% 




V 



2. 31.59 - 7.93 - 23.66% 




♦ -.1 -a 



9- '3 



^1 



ERIC 



348 



3 b' 




349 

36' 




4.8 POSTTEST 

1. a. 66 and 71, z - -1.5 to -.25 - 43.32 - 9.87 



33.45% 




b. 72 and 80, z - 0 to 2 - 47.72% 




c. below 68, £ ■ - 



.0 1 

1 - 50.00 - 34.13 - 15.87% 




2. a. 450 and 625, z 



-.5 to 1.25 * 19.15 + 3a.44 - 58.59% 




-s lis 



ERIC 



351 



1 



366 



3. c. 20.3 and 21.9.- z - -.53 to -.06 - 20.19 - 2.39 - 17.80% 




4.9 POSTTEST 



a. 
b. 
c. 
d. 
e. 

a. 
b. 
c. 
d. 
e. 



For £ 
For z 
For £ 
For z 
For J5 

For P.R. 
For P. R. 
For P.R. 
For P.R. 
For P.R. 



-2.4, P.R 
-1.2, P.R 
0, 
.9, 



P.R. 
P.R. 
P.R. 



15, z 

25, z 

AO, z 

60, z, 

85, z 



-0.82 

- 11.51 

- 50 

- 81.59 
98.93 

-1.04 

-.67 

-.25 

.25 

1.04 



a. For X - 14, P.R. - 90.82 

b. For X - 5, P.R. - 4.75 

c. For X - 9, P.R. - 37.07 



a. 
b. 
c. 



For P.R. 
For P.R. 
For P.R. 



50, X - 10 
80, X - 12.52 
43, X - 9.46 



Unit IV Review Test 

1. a. 
b. 
c. 
d. 
e. 
f. 

8* 
h. 
1. 



36.7% 
98. 9Z 
18.96% 

14.98% o£ 1000 
9.19% 
26.43 
3.01% 
69.64 
62.47% 



- 150 (approx.) 



a. 
b. 
c. 
d. 

e. 
f. 



77.34 
27.43 
88.49 
99.74 
50.00 
53.98 



ERIC 



353 



36, 



UNIT V 

5.1 POSTTEST 

1. Correlation is a method or technique used to determine the relation^ 
between two variables., 

2. a. number of digits: (a constant) cannot^be uSed 

b. nunfcer of freckles: (a variable) can be used 

c. number of illnesses: (a variable) can be used 

d. nunfcer of toes: (a constant, assuming , sloths are physically 
normal) can't be used 

e. number of friends: (a variable) can be used 

ij ■■■ 

a I. height * ,' 

— c_ I. absences (the more absences, the lo|^^ .thf grade) 



a 3. pulse rate * 

b 4.^1Q (the higher the IQ, the higher the grade) 

> ^ 5. TV watching (the more tV, the lower the grade) ^ 

^ 6.V books tead (the more boqks^read, the higher the grade) 

Answer is pair ^3, nunfcers of bars and Schools in different cities. 

Pair //I, ag6 of automobile and miles per gallenr, should show neRative 
correlation, i.e., higher the age, the lower the miles per gallon 
(I.e., efficiency). 

Pair J^2, dibtance north and average daily j^mperature for cities, 
should also correlate negatively, i.e., the farther north one is, 
the colder it gets and the lower this tiemperature . 

Pair //3 should correlate positively due to population of city as an 
intervening variable. The hjj^er the population, the more bars; the 
higher the population, the more schools. Thus, cities with many bars 
^^^^ tend to' have many schools and vice versa. 



5.2 POSTTEST 



1. Th^re are numerous possible procedures that could be used. If her 
students were cooperative, she could have them fill out two question- 
naires, one designed to determine how many cigarettes they stroke, / 
and the other designed to assess their attitudes toward school (per-^ 
haps a 10-point. scale). Thus, her units of analysis would be 
students. The variables to be examined would be nucfl)er of .cigarettes 
smoked per day and attitude toward school (with a score of "1^ 
indicating a very poor attitude and a score of •'lO'' indicating a 
very favorable attitude). After measuring each , student in her sample 



354 



3Vj 



J 



0 



3. 



on both of these'' varl/fcles , she would then tabulate the «ata!..by ^ 
listing each student's cigarette score and at^ude score next 
to his/her name, finally, she may wish to construct a'scatter- 

Iplot to obtain an immediate overall impression or the obtal^ed^ , 

delation. ^ * , v 

liiere does appear to be a relation between height and po^nt gjg)- 
diction for this particular satiiple. Generally speaking, the 
Iteller players tend to score more 'points than do the shorter . 
players. That is, high scores on one variable tend to be asso- 
ciated with high scores on the other, and' low scores on one with 
^ low scores on the other. ~ Thus, the relatiAn woulfi, be positive. \ 

^ V y 

*The. units of analyses would be 'Mogs.." Each dog included in. *he 
sample would be measured witfTSrespect to»aggression tendencie^ 
and amount of punishment received. The lattek measure might be 
obtained, of course, through interviews with the dogs masters 
(as the dogs themselves may be reluctant to admit anything). 

f '' 

This one might be a little difficult to understand at first. The 
appropriate units would have to be "days." One could randomly 
select a sample ©f "days" from a particular calendar year\ Fo^ 
each day, we could check existing weather information to deter- i 
mine its average teuperatiure, and medical- records to determine 
the nuiid>er of fainting spells which occurred. Our me.arures 
would, of course, haveVto be r'e8trict,ed tow a particular area . 
location (e.g.. New YoJk City, Pittsburgh, our home toim, etc.) 

Now, if the Interest shifted to fainting by people on a particular 
' day, the units <>f concern would j^e-yeople. That is, people could m> 
provide a measure ipf fainting (6 - no; 1 yes ; 2 - more than one 
time) and a measurt of body^emperature (98.8, ia4,^etc.). Do, 
you see the. difference be6wee!r-tltts^x«eearch question and the one ^ 
above? The first asks what daily* temperature does for daily 
fainting; the second asks what people's temperature does for | 
people's fainting. ' • . 



ERIC 



355 



POSTTEST 



1. 



( 20. r 



Salary 



15 



10 



I 




b. There appears to be a moderately^ ^J:rong positive r^Latldfc (the 
points slope upward, but cannot all be fitted on a straight ^ 

, line). * • • I 

c. If the new subject's scores were* Included In the analysis, the . 
relation woiild be weakened » This Is beqause his scores run 
contrary to— the overall tendency for hl^ salaries to be 
associated with additional yeaxs of education^ if these scores 

iwere to be represented by a point on the scatterplot, the 7 
result would be a reduction in slope and more deviation from a 
straight line pattern, ^ 

2. tlot A: ^strongest relation because i^re points could be fitted on • 



a straight line. * * 

Plot A: very strong positive relation 

Plot B: moderatenstrong negative relation 

Plot C: weak^moderately strong positive relation 

Plot D: yfery strong negative relation 

Plot Ej: no relation * < 



* \ 



1^ 



A 



POSTTEST 



The correlation coefficient is a numerical value that clearly 
specifies the strength and direction of the relation between 
variables. It r^ges from +1.00, indicating a perfect positive 
relation, to -1.00, indicating a perifect negative relation. The 
middle value is 0.00, indicating no |jklatfLon. The sign of the 
coefficient indicates the direction of the relation., whereas the 
absolute value of the num^ri^al component indicates the strength 

^ - ' \ ' . * 

#3 



of the relation, 



a. 
b. 

cl 
d. 
e . 



Plot 
Plot 
Plot #1 



.^lot 
Plot 



#5 



3. a. grades and 4^ttng: imperfect negative 

' grades and absences; also, dmperfect^egatiye 

3 J ' ' 

b. absences are th6 better predictor since their relation to 

grades is stronger $ ' . 

• « ' * *^ . 

\. c. H person with many dates (higlj^core) should have relatively 
low grades (low scor^) , as suggested by the direction of the 
obtained correlation. This would not necessarily apply, how-\4 
ever,* to all students because the relation is imperfect, 
suggesting that there are exceptions to the overall tendency. 
Thus, can make an ^ucated guess b^t not be totally certain 
about John. * 

d.^ Ploi B would probably represent the dating-grades^ relation since 
it represents a weaker relation than that depicted in Plot A. 
Plot A, of course, would be more likely. to deiScribe the absences- 
grades relation. > ^ 

4. a. years of education and income: Imperfect positive 

* distance from city $ind income: imperfect negative 

distance would be the better predjLctor a^inte its relation to . 
income is s^ironger (even though the direction of the relation 



b. 



is ^eg|^tive) . 



0 



people who have many years of education-^^ould tend ^o have high 
incomes. People' who live far from large cities (hi|h scores,^ on 
distance variable) would tend to be low in income.. These would 
not necessarily apply to all pedple: the relations are not par- 
ticularly strong and thus there ^e likely to be many exceptions 

the distance-incoiaMeiation would *moi;e closely approximate a 



Straight lin^ bemuse it ia' stronger. Stj^ngth has nothing to 
do with direction of sien in* a coefficient or slope in a 



do with direction o^ .sign 
scatt^fplot . 



357 



37,2 



PPSTTEST ^ 

» ■ , - . 

1. a. When variables are positiveiy related, positive z scores on the 
X variable will tend to be associated with positive z scores 
on y. The result will be poslj^ive cross-mi^ltiplication products, 
^X^if* Conve^rsely, negative scoces will tend to be associated 
. witn negative ^z^ scores, also producing positive^ cross-products. 

b/ When variables are inversely related positive z^ scores will tend 
to be associated with negative Zy scores, "and Vicie vert^. Result 

^ will be mostly negative ^ross-products, a negative sum of cross- 
products, and finally, a' negative Ji. 

c. When variables are unrelated, some positive z^^ scores will be 
associated with positive Zy scores, resulting in positive cross- 
prodvlqts. Otheris, However, will be associated with negative z^* 
scores, producing negative cross-products. The positive and ^ 
^ negative crojfes-products will. tend to cancel each other out, pro- 
ducing a low sum and a low /i. ' ' 



2. Student 

A • 

B 

C 
D 

. E 

G 

^ ^ = 7^ = 









-1 .50 


-1 


.20 


-1 .20 


-1 


.50 


- .80 




.40 


- -.40' 


+. 


.80 


.00 




.00 


+ .40. 




.80 


+ ..80 


+ 


.40 


+1.20 . 


+1 


.50 


+1.50 




.20 



+ .80 



+1.80 
+1.80 
+ .32 

- .32 
.00 

- .32 
+ .32 

. +1 .lap^^- 

+1.80 
= 7.20 



358 



0 



Student 


V 
A 




Y 


X2 


y2 . 


XY 


f 






5 ^ 


9 • 


25 


15 


If 


1 

X, 




2 


1 

n 


4 


i 


L. 


2 




5 


4 


25 


10 


M 


4 




3 


16 


9 


12 


N 


1 




I 


1 


1 

^ 


1 




EX = IT 






31 


64 

* 


40 






(5) 


(iO) - 


(11) (16) 







yj (5) ,(31) - 121 5(64) - 



256) 



24 



V 34 V 



64 



24 



46.6 



,52 



Contestant 


Congeniality 


Muscle Tone 


V 


I. 


A 


1 


6 


-5 


25 


H B 


3 


4 


-1 


1 


C 


6 


5 


1 


1 


D 


. 4 


3 


1 


1 


E 


7 


V 


5 


25 


F 


2 


7 ^ 


-5 


25 


' G 


5 


1 


4 


,16 










= 94 



> = 1- ^ 

<6 



^ , 564 

7(49-1) 336 



= 1- 1.679 = -.68 

Results imply that congeniality and muscle tone ranks are 
negatively correlated. The contestants with smaller muscles 
aren't as friendly (maybe they don't have to be!) 



ERIC 



Unit V Revi ew Test 
"A. 



c 


1. 


e 


6. 


e 


2. 


i d 


7. 


b 


3. 


c 


8. 


d 


4. , 


b 


9. 


d 


5. 


a 


lb. 



359 



37^ 



\ 



POSTTEST* ^ 



UNIT VI 



1. Probability refers to the relative frequency with which an event 
is expected to occur. 



2. 3/5 or 60% 

3. 10/60 or 1/6 or .167 

POSTTEST 

1. a. mutually exclusive 

b . honmutually exclus i ve 

c. ^onmutually exclusive 

d. mutually exclusive 

e. mutually exclusive 



J 



h -li + M 2 ^ _80 

210 210 " 210 ° 210 



or 38Z 



_8 . JO 
65 65 



8 10 
65 = 65 " 



3 2 5 
^- 15 "^15 =15 " 



• 4 



P(both) is subtrancted because the individuals involved in one set 
of events are 'also involved in the other set. If you don't subtract 
P(both), they get counted twice and .probability is distorted. 



POSTTEST 



a. independent 

b. dependent 

c. dependent 

d. independent 



2. a. ~ 



18 

20 



II 
19 



306 

or 81% dependent trials I 



b. 



18 18 32 A * 
20 * 20 * 400 ""^ independent trials I 



_4 4 ^16 4 

°* 52 * 51 " 26-52 "663 

13 . 12 ^ ■ 156 _ 13 „ 1 



c. 



at 



^52 51 ^2652. 221' 17 



. _8 . _8 64 ^ 16 
°* 52 51 2(>52 663 



Unit VI Review Test 

1. a. 27/32 or 84Z 

b. 5/32 or 16Z 

I 

c. 32/32 or 100% 

2. a. 15/127 or 12% 

b. 20/127 or 16% 

c. 35/127 or 28% 

d. Ti|.^ = -rr^- dependent trials! 



15 


14 


210 


105 


127 


' 126 ' 


16002 


8001 


15 


. -io = 


300 


= 150 


127 


126 


16002 


8001 



3- a. 



Mutually exclusive events refer to outcomes that cannot occur 
together, e.g., drawing one card and getting a queen atnd a king. 
Nonmutually exclufiive events are outcomes that can occur together, 
e.g.^dr^ing one card and getting a queen and a diamond. 

Independent trials refer to the c^se where outcomes on one trial 
have no influence on the outcomes in another trial, e.g., drawing 
a queen from one deck (Trial 1) and drawing a queen from another ^ 
deck (Triai 2) . , 

Dependent trials refer to the case where outcomes on one trial • 
influence what occurs on another trial, e.g., drawing a queen from 
a deck (Trial 1), keeping it, and trying to draw another queen from 
the remaining cards (Trial 2) . 



4 . a. 
b. 



|e , 



5/316 or 1 


.6% 




5 4 


20 


1 


316 * 315 


" 99540 ° 


4977 


311/316 or 


98.42% 




311 , 310 

316 315 


96410 - 
99540 


about 


1_5 . __1 
316 316 


= " 25 
99856 





i 



361 



376 



V 



UNIT VII 



1 . 1 POSTTEST 



f 




1. A random sample its representative of the population from which 
IC Is selected. Random sampling Involves Insuring that every 
element (person or thing) In the population of Interest has an 
equal chancel of being selected. 

2. Statisticians use samples to estimate population parameters 
since whole populations usually cannot be tested. A random 

, Sample helps Insure that the estimate (statistic computed from 
the ^sample) will be based on representative data. A random 
sample Is likely to provide a close estimate of the population 
parameters; a biased (non-random) sample will provide, of course » 
a biased estimate. Also, statistical procedures developed for 
hypothesis testing require that the samples employed are random. 

3. The most stralgh^f ottirard procedure would be to write the names 
of all 300 S^s (subjects) on separate, pieces of paper. Place all 
the names In a container, mix them up, and without looking Into 
the container, select the 50 names one*-by-one. The result will 
be a random sample of SO Individuals because each name had an 
equal chance of being selected. A more efficient procedure 
would be to use a random -frotat^s table of the type found In 
almost every statistics textbo^k^ 

4^ Jhls one Is a little tricky, ^he resultant sample will be 

random , but not for the population of male citizens who reside 
In Catatonia, Nevada. It Is true that random selection proce- ^ 
dures were employed (I.e. , the use of a random numbers table). 
Butt did every male citizen meeting the requirements have an 
equal chance of being selected? The answer Is '*no,*' because 
the phonebook was used to define the population* What about 
Individuals with unlisted numbers? What ^bout Individuals who 
don't have phones? What about Individuals who recently moved 
to vthe ^rea and* are not listed, as yet. In the phonebook? 
Hojpefully, all would agree that the resultant sample of 50 
IsNjrandom with respect to the population of males whose names 
are listed In the Catatonia phone directory; but the sample Is 
biased with respect to the total population of males over 18 
who reside In Catatonia. 



362 



'■ii; 



POSTTEST 

1. Statistics are indices that are derived from samples ; 
Parameters are indices derived from populatiorts , 
(Samples yield statistics and populations yi6ld para- 
meters.) Statistics are used as estimates of parameters. 
Greek letters refer to populations and Roman letters re- 
fer to samples. 

2 k Sampling f^.uctuation ^ciirs where there is variation 
between representative samples for a given 'population. 
Samples allow for probabilistic estimations because 
only a portion of the entire population is dealt with 
and, therefore, is subject to chance variation. 

3. Generally speaking, a Sampling Distribution represents 
the |>robabilities of obtaining values for a given statis- 
^ tic computed for each of an infinite number of samples 
that are the same size and that all represent the same 
population* The three major factors determining its pro- 
perties are the type of statistic, the size of the sample, 
and the nature of the population. 

POSTTEST \ 

1. a. Unbiased 

b. Unbiased 

c. Biased, underestimate 

d. Unbiased ^ ' 

e. Unbiased 

f. Biased, underestimate 

2. The larger the size of the sample, the smaller the aipount 
of bias Biut-vi^e versa - why? Large samples are closer to 
representing tlfexpopulation, and populations yield true 
vSlues. c^^^'"^ 

POSTTEsi^^^^^^ J 

1. If the population from which sampling takes place is nor- 
mally distributed, the Sampling Distribution of means 
will also be normally distributed. If the population 
scores^ are not normally distributed, the form of the Sampl 
ing Distribution of means will approach normality as sampl 
size increases. 



363 

376 



The Catcher the population is from ''normality" the higher 
Che n needed to ensure ^normality in the Sampling Distri* 
bution; e.g., a .seriously skewed population necessitates 
a larger sample ri to ensure normality than does a moder- 
ately skewed population. 

a . _2 

b. 3 

c. 5 

d. _4 

e. _6_ 

f. _7 

E(X) « Ux- That is, that if X's are computed for an in- 
finite nixmber of random samples, the mean of all the X's 
will be equal to the mean (u^) score of the population 
from which the samples are selected. Sample size (n) does 
not influence^i^this relation. The means of iall samples of 
the same size (regardless of their absolute size) would 
still average out to be the population value. 

a refers to the standard deviation of all raw scores ^n 
the population ; ^ refers to the standard deviation of 
all scores (actually sample means, X's) in the Sampling 
Distribution; it expresses the variability in the Sampling 
Distribution. As sample size Increases, the Standard 
Error of the Sampling^istribution decreases. 

a. n =kA, a - 2a> tf^ * 70 

Expected Value =» 70 IE (k) « a^] 

^ ° ^ f ^ " 20 , 




b. n - 16, a = 20, " 70 

Expected Value =■ 70 [E OC) - u^l 
" o ' 20 



364 

37 J 



c. n - 64 , o » 20 , - 70 

Expected Value - 70 (E(X) » u^) * 

a— m a ; cf— = 20 
^ X -; — - 

^^^ 
03^-2.5 

POSTTEST ' 

I. a. 'greater than 26: In this Sampling Distribution, E(X) ^25, 
0^-1. Thus, z - 26 - 25 - +1.00- 

Using the table, the percentage of scores greater than z " 
+1.00 (50.00 -.34.13 - 15.87) - 15.87Z ' 

b. ' between 24 and 26: Fo^ 24, z - 24 - 25 = -l.OO 



For 26, z - +1.00 (see above) 



Using the table, the percentage of scores between +U 
and -Iz (34.13 + 34.13) - 68.26Z 

c. betwetn 23.5 and 24: For 23.5, z - 23.5 - 25 - -1.50 

I 

For 24, z^^-'l.OO (see above) 

Using the table, the percentage of scores between -1.5z 
and -Iz (43.32 - 34.13) = 9.19% 

2. If 4 cases are used E(X) will still - 2l^, but is now =■ 2 

i. between 26 and 28: For 26, z = 26 - 25 •» +.50 

2 

For 28, z - 28 - 25 - +1.50 
2 

Using the table, the % of scores between +.5 and +1.5^ = 
(43.32 - 19.15) 24.17% 

b. less than 23: For 23, z = 23 - 25 = -l.OO 

2 



on table, X less than -Iz (50.00 - 34.13) = 15.87% 
c. between 24 and 26: For 24, z^ ^ -.50 
* Fgr 26, z - +.50 
on table. % between +.5z. and -.5z (19.15 + 19.15) - 38.30% 



365 



V 



1 /a . 



Unit VII Review Test 

A random sample would Insure that every member -of the population 
..has an equal chance of being selected, therefore Increasing the 
chancers of accurately representing the whole population Involved 

One procedure would be to wrli^e the scores of all 1000 dogs down 

on separate slips of paper. Place them In a container, mix them 

up thoroughly* and then pick oTit the slips one-by-one until the 

desired sample quantity (or size) has been obtained. 

The larger your sample, the more reliable ' your estimate of 
population parameters. 



2. a. 
h. 

d. 
e . 
f, 
g- 



Unbiased 

Unbiased 

Biased 

Biased 

Unbiased 

Unbiased 

Biased 



ERIC 



3* A Sampling Distribution represents the probabilities of q^b- 

talnlng values for a given statistic that is selected frdm an 
Infinite number of samples that are the *same size and that all 
^ represent the same population.^ 

4. a. The statistic ; Which statistic are we Interested In? The mean 

will have one type of Sampling Distribution, and the variance, 

for example^, will have another. 

^ • ■■■■ 

b. Sample size : The larger your sample size, the more accurate 
your estimation. 

c. Population ; Sampling Distributions are clearly influenced by 
the population from which the samples are randomly selected. 
They will^ differ on the basis of population form, central 
tendency, and variability. 

5. a. Its form would b'e'^ normal distribution because the population 

is normally distributed*. 

b . E(X) » . ^ 

E(X) « 8.5 

c o g ; « 2.3 ; - 2.3 » 0.575 - .58 
VT" \/~T6 4 

6. the Standard Error (c) would change: 

- o : - 2.3 - 0.46 
VT" V^25~" * * 



7. a. greater tha^n 9.(Ti 



b. less than 7;0; 



In this Sampling Distribution, E(X) » 8.5, 
07 « 1.15. Thus, z « 9.0 - 8.5 - +.43. 

^ ^ 1.15 
Using the table, the Z of scores greater 
than 2 » +.43 (50.00 - 16.64) « 33.36% 

z « 7.0 - 8.5 - -1.30. Using the table, 
1.15 

the % of scores less than = -1.30 

(50.00 - 40.32) = 9.68%. 



* ... . !■ 

between 8.3 and 8.7: The standard error has changed as a 

result of using n - 9. Compute it first 



2.3 - .77 
3 



For 8.3. z - a. 3 - 8.5 ° -.26 
.77 

For 8.7, £ =» +.26 

* Using the table, we get: 10.26 + 10.26 - 
20.52Z 

d. betweeti 8.5 and 9.5: For 8.5, z ' 0 

. For 9.5, z « 1.30 

Using the table, we get: 40.32% 

e. Super Bonus : To determine the score above which 5% of the 
sample means fall, you need to: 

1. Realize tliat you are actually looking for the score 
associated with P95 (95% below it, 5% above it). * 

2. Realize that P95 is associated with a z score that has 
45% (50%-5%) of the scores between it and the me^n. From 



the table, this £ would be +K64. 



3. To find the raw score value, use your z score £onversion, 
butL remember that you are dealing with sample X's and jieed 
to u^^E(X) and o^. 



1.64 » X " 8.5 

1.26 » X - 8f5 
9.76 - X 
Answer: 9.76 



ERIC 



367 



3S^ 



4 



UNIT VIII 



8', 1 POSTTEST 



Since hypothesis testing generally involves the use of statistics 
(sample data) to estimate parameters (population data) , the conclusions 
reached must be expressed in probability tei^* Since parameters 
usually cannot be computed directly, they must be estimated on the basis 
of sample results. Samples fluctuate, and therefore, the researcher 
cannot be certain that the samples he^ employed actually provide close 
estimates. For this reasoK, we are restricted to expressing con- 
clusions as probable, rather than certain. 



8,2 POSTTEST 



1, One possibility: 



uj a2 



Another possibility: 



Ma • u| < U2 



2. Set 1: Incorrect — must be specif iejd in terms of "no 
^differences." |uj » ajj) 

Set 2: Correct 

Set 3: Correct 

Set 4: Incorrect — alternatives must be exhausti^; no allowance 

has been made for the poesibility that o^j 
might be less than ' v 

' *^ Set 5: Incorrect — altemative^mist be exhaustive; no alLdwance 

^ has been made for the po^ibility that uj 
might be greater than 23. 

alternatives must be nonov^rlapping^; if Uj 
^ turns out to equal 22, more than one 

alternative would be Valid. 

^- Set 6: Incorrect — alternatives must be exhaustive; no allowance 

has been made for the possibility that 'U| 
does not equal either U2 or the value, ICQ. 

if 



8.3 POSTTES 




1. A Type I err6r occurs when one rejects a null hypothesis that is true. 
A T^pe II error occurs when one retains a null hypothesis t^iat is 
f dlse . * 



. 368 



38. 



2. The deciiMton to reject or retain the null hypothesis is directly ^^^y^ 
related tb the probability level that is employed. The use of the 
.05 leVi^I, for exaiq>le» may result in the decision to reject ^he 
null, whereas the opposite conclusion maj^^-be Justified were the .01 
5leveli employed innhe same study. ThUs/to eliminate bia8,^he • 
probability level should be^Jip||cified TOfore the sample data is 
collected. That way, a researdhef cannot be criticized for using 
a probability level specif iqa^y designed to yield an out c^" that 



he favors* 



3. The largeii the probability level, the greater the chance of committing 
a Type I eVror. • ^ ^ • 

The larger the probafe*J^tj|t level^he smaller the Chance of committing 
a Type II error, ) . i 

4. a. The chance of a "JVpe I error is 5%. 

b. NA; impossible to compute the probability for a Type II error ^ 
without further information^ \ 

c. He may be committing a Type I error by his rejection of the null. 

d. Type I;, lOX ^ 

Type J^: chances would decrease, ^xact probability is. "NA." 

• ' ^ ^ ' N 

POSTTE&T , • . 

« 

1. a. : ' 5(!|0 ^ . i • 

b. oj - 100/ VTOO -10 ' / % 

, 482-500' . . , flo t • M \ 

c . Z jQ— - - -1.80 ^ ^ 

'- li 

d. Rgtain the null: critical value for the .05 level Is -1*962 ^ . 

and +1.962*. The computed 2, therefore, 
does not fall within the region of* rejection. 

e. A Type %I ertor: {allure to reject ^^^Ise null hypothesle. 

. , \ 520-500 . 2.00 ^ * 

Reject the ntill \ * * ^ 

g, Type I 



a. 



: ^ - 25 



b. 0^-5/ VIOO - 5*/ 10 - .5 . 




369. 



. 23.5 - 25 -aL.5 _ , 
* * * 

d. Reject the null; computed 2 exceeds the critical value, -1.962 

e. Type I 

f. Reject the null; computed 2 value exceeds the critical value, 
2.582 * 



8- 



If a sample of 25 batteries were used, the Standard Error of 
the^ Sampling Distribution would increase; 



z 



=5/5=1 
• 23.5 - 25 

3 , 

1 



= -1.50 • 



Retain the null 

h. Type II . 

i. BONUS: By reducing* sample size, the chances of making a Type I 

errot will remaiti unchanged. Type I errors are totally 
dependent upon the level of significance. 

The chances of making a Type II error, however, will 
be increased since the Standard Error of the Sampling 
Distribution will increase. This will require a greater 
difference between the sample mean and the parameter 
^ value to Justify rejection of the null. Thus, there will 
be a greater probability of failing to reject a null 
hypothesis that is actually false (i.e., making^a Type 
II error) . 



5 POSTTEST 



1 



1. 



2. 



a. 
b. 

a. 

b. 



2.82 
2.07 



c. 4.54 

d. 3.46 



2.23 



as n Increases , the- critical t value decreases. 

as alpha increases (say, from .01 to .05 to .10, etc.), 
the critical t value decreases . 

t must be used; population data ate unavailable. 

chances for a Type I error are 5%; chances for a Type II 
cannot ^^determined from the available data.* 





e .05 level with = 8, the critical value is 2.31. 



-2.5 



f . 



« -2.50; Decision would be^o reject . 



using .01, <>ritical t = .3.36; Decision would be to retain . 
Type II • ' 

370 



nit ytll Review Test ^ * 

1. No» he had to rely on a sanqple to make Inferencies about the population. 
The sample could be composed of unusual people and give a misleading 

^result. The findings are only suggestive, not conclusive. 

2. type I since the null was rejected. 

3. Answer Is "d" since his sample mean was hlghet than the parameter 
^ value . . 

A . Answer is "b , " , • 

5. Answer is -"d." 

6. The null Is 'assumed to be true until, evidence is obtained which 
makes Its truth seem Improbable. 

7. No I The .01 level has even stricter criteria (larger critical 
values) for rejecting the nuPl. 

8. a. Hq : Ux = 80- 



1?. +1.96% 
86'- 8 



d. Reject the null 

e. Type I . ^ , 

9. a. We don't know the population variance (a) 

^ 68 - '62 601^ 

c. for di - 24, t - 2.797 

d. Retain 



O- .371 3g 

ERIC N 



I 



UNIT IX 

9.1 POSTTEST 

91-79 



1. a. <t • 



V 29(100)-t-29(95) M + l\ 
30+30-2 . \30 30/ 



12 



2900+2755 



(i) 



58 

12 - 12 =12 



V 56S5 _2 7 11310 yJ6.5 

58 30 V 1740 



12 
" 2.55 

Ans. ■ 4.71 



I' ° * b. For - 58, ^ - 2.70 at a = .01 ^ 

c. Reject the null ; 

d. Type I 

2. Sex— -It Is the only variable likely to show a significant relationship 
to facial halr« * ' 

3. Race— it would show the highest correlation with attitudes In* this 
study. * * 

w2 (60)^ 

4. Cownts Ho CoBmcnts V 21 21 /JO - 

^ ^ (i2 - 736 - 360 
- 376 



Coanents 


Ho Coaraents 


£ 


I 


I 


80 


75 


5 


25 


25 


98 


90 


8 


64 


64 


75 


77 


-2 


4 


4 ' 


83 


77 


6 


36 


36 


100 


88 


12 


144 


144 


63 


65 


-2 


4 


4 


79 


60 


19 


361 


361 


94 


93 


1 


i 


1 


90 


81 


9 


81 


81 


70 


66 


4 


16 


16 




El? 


- 60. 


ZV^ - 


736 



« , , ' a. 2.94 • 

^ ^ . b. 2.26 (d6 = 9) 

^ _ 6 _ 6 c. reje.ct'null. Comments 



VXI7" 



are more effectli^e 
th&n no comments, 
d. Type I 



ERIC 



372 



7 



\ 

9.2 POSTTBST 

l.a. Hq t Qjiy - 0 



b. - n - 2 

- 30 - 2 

- 28 

t,oi - 2-. 76 



^ .36 



>/(!-• 362)728 
. .36 
•y/ .87/28 

- .36/. 176 

- 2.04 

d. ' Retain the null 

e . Type II 
2. a. Hq i Qxy ' 

Ha '• P«j/ M 

b. t Q5 for of 14 - 2.145 v 

] ^ .78 ^ 

' c. t" — 



•y/(l-.782)/14 



.78/. 167 ' 



"\/. 392/ 14 



- 4.67 

d. Reject the null 

e. Type I 



ERIC 



373 



3So 



0 



ERIC 



9.3 POSTTBST 

1. Point eftlmatlon uses a single score value^ such as one^satnple 
wan 9 as the sole biuils for estimating the population paraneter 
(l.e.9 oean) • Interval estimation also uses a value (|»btalned 
from a single sample as the basis for Its estimate of the popu* 
latlon. Buty It further employs knowledge of the Sampling Dls* 
tflbutlon (standard error and probability) to set an Interval 
around the estimate and to specify a degree of confidence (based 
on probability) that the Interval will contain the parameter. 
(Try to use your otin vords. Memorising this vord for vord may 
produce permanent brain damage and no ldea» before the damage sets 
in, of vhat It means.)* 

2. "d**: all are true. 

3. a4 





m 






m 


n -1 - 35 


^.95 


m 


2.04 (approx.) 




m 


12//36" - 2 




m 


68 + 2.04 (2) ^ 




m 


63.92 (lower), -t- 72.08 (upper) 



f 99 oJi .01 ' 

Con6 99 • 68 + 2.75 (2) 

- 62.5 (lower) +73.5 (upper) 

c. * Point estimate « 68 

Unit IX Review Test 

A. 1. a ' 6* d B. Con^g^ - 525 ^ 2.80 (22.4) 

» 462.28 and 587.72 
Con< 95 " 525 *_ 2.06 {22.4) 
» 478.85 'and 571.14 



1. 


a 


6* 


d 


B. 


2. 


d 


7. 


b 




3. 


b 


8. 


c 




4. 


c 


9. 


d 




5. 


a 


10. 


f 





374 



St) J 



UNIT X 



10.1 POSTTBST 

1. Interval-ratio ineasures are scores that can be ordered on a 

continuum from low to high, and have equal distances between points. 
Test scores • height, weight » etc. are exaiq>le8. 

% 

Ordinal scores can be thought of as rankings. Theyi,, too, can be 
ordered on a continuum from low to high. But one cannot ln£er that 
^ there are equal distances between ranks on the^ trait being measured. 

For example. In a beauty cpntest ar^ all contestants likely to be 
the same amount "more beautiful" than the person finishing Immediately 
behind them? Probably not: the 3rd and 4th place finishers might 
have been v^ry close; the 5th place finisher way behind them, etc. 



Nominal measures simply Identify a sbbject as a member or nonmenl>er 
In a particular category. The scores are ij's~frequency counts for 
the different categories. 

Chl-square Is applied to nominal data. An ^xampl^research question 
is "Is hair color for school children eqjially distributed among 
, blonds, brunettes, and redheads?" "Are attitudes toward capital pun* 
Ishmeiit equally divided ampng supporters/ nonsupporters, and un- 
decldeds?" 



10.2 POSTTEST 



l.a. Ho 0 » E It means that the different types of books are selected 
with equal frequency. ^ 

b. Novels Nonfiction Children Other Total 

Observed: 40 25 2B 15 108 

Expected: 27 27 27 27 27 

^2 . (40-27)2 ♦ (25-27)2 ^ (28^27)^ (15-27)2 
3i8 ^ 

- 11.78 

\ 

d. for di - 3, - n.34 

e. Reject (barely made it I) ^ 
£. Type I 



o 

ERIC 



•375 



39o 



r 



2. a. 



Cauc. 



Black ^ Chic. 



Other Total 



Observed 
Expected 



35 
40 



41 
40 



20 
15 



4- 
5 



100 
100 



b. 



- 25/40 + 1/40 -iiSS/lS + 1/5 

- 2.52 

Crlt. - 7.82 (from Table III) 
Retain the null. 



3. 



Cauc. 



Black 



Chic: Other 



Total 



Observed 
Expected 



35 
25 



41 
25 



20 
25 



4 
25 



100 
100 



.2 - 



100 + 256 + 25 + 441 



25 



822 
25 



- 32.88 
Reject the null. 
4. If - 1 in a one-way test, Yates' correction is needed. 



10.3 POSTTEST 



i.a. c s - ' • ^"^^ 



200 



200 



200 ' 



200 



200 



.2 - 



(10-21)^ (20-21)^ . (30-18) 
21 21 18 

(60-49)^ . (50-49)2 . (30-42) ^ 
• 49 49 42 

/ 

5.76 + .05 + 8 + 2.47 + .02 + 3.43 
19.73 




b. ii' (R-1) (C-1) 

- (2) (1) 

- 2 



ERIC 



376 



3yi 



c. tabled x^^^^ " 9.21 

d. reject the/ null 
2. a. E'b - 8.75 21 40.25 

16.25 39 74.75 

2 (10-8.75)^ . (20-21)^ . (40-40.25)^ 
X 8.75 "^ 21 40.25 

(15-16.25)^ ^ (40-39)^ . (75-74..75)^ • % 
16.25 39 74.75- 

- .18 + .05 + 0.00 + .10 + .03 + 0.00 

- .35 or .36 

c. Retain th^J^ull 

Unit X Review Teat ^ 

1. A one-way test looks at results on one variable only, testing for 
equal frequencies or goodness-of-flt . 

Example; Are equal nufid>ers of automobiles presently using regular, 
unleaded, and dlesel gasoline? 

A two-way test looks at the relationship (or Independence) of two 
variables. 

Example: Is the type of gasoline used (regular, unleaded, or 
dlesel) related to the number of doors In the vehicle (two or 
four)? 

2. The equal frequencies test examines whether 'the same nucber of V® f^^l 
Into each of the other categories . The goodness-of-flt Instead 
examines whether the observed for the categories conform to some 
hypothesised or known distribution. 

Formula: the same for both 

0*6 I Whatever Is observed In the particular study. «)*8 are 
determined In the same way for both.) 

E'A : Differ depending on the design. In the equal j case, 
E - Total Obs . 
fe 

All E*s will be the same. 



377 

33, 




In goodne88«-of«-flt test, E*8 will be the n{^inber (proportion) 
of ^ hypo the 
categories . 



of ^ hypothesized for each category, E's vl^l differ across 



3. a. Equal ({'s Is appropriate, since he Is trying to determine whether 
^'s are similar at4lvferent class levels. 

b , (20-25) 2 (30^25)2 (25>2S)2 , (25-25)' C 

^ 25 25 25 25 

-l+l + O + O 

^ 2 / 

d. Retain the null * 

4 a v2 - HQzili^ . (10-30) ^ . (30-20)2 (30-35) ^ 
^ 15 * 30 * 20 ^ 33 

- 15 + 13.33 + 5 + .71 

- 34.04 . : 

c. Retain the null 

d. The distribution of majors' attending games does not conform to the 
distribution of all majoris in the university. Arts and Sciences 
and Education majors are over-repredented at games; Business and 
"Other" majors are under- represented. 

(50) (AO) . g (50)<45) . Q . (50X35) . 53 
120 . io-o/, 120 120 ^^'^ 

. I 

2 . (30-23.33) ^ . (15-26.25) ^ . (25-20.42) ^ . 
^ 23.33 26.25 20.42 — 

(10-16.67) ^ . (30-18.75)^ . (10-14^.58) ^ - :i 

16.67 18.75 14.58 

- 1.91 + 4.82.+ 1.03 + 2.67^ 6.75 +1.44 

■ "S. ■ " . i . " * 

' Reject the null g 

b. PollticalHP^fetence aiid newspaper preference appear to be related. 
Republicans and "Others" seem more inclined to pick the Press 
Minotaur . Democrats seem more inclined to pick the Daily Planet . 

378 

ERIC . • • ' . < 



UNIT XI 



11. 1 POSTTEST 



The t tetfc holds the Typ^ I error rate at a for each comparison. 
If there are mo r^ than two treattoentSt more than one t test will 
be needed to analyse each unique pairing (e.g., 1 vs. 2, 1 vs. 3, 
2 vs. 3). The ctiances o1^ making a Type I error In at least one 
of those analyses exceed a since each separate test carries that 
amount pf risk by Itself. \ 

2. As more treatments are added to the design, the Type I error rate 
will Increase. More treatments mean more chances of selecting ah 
unusual sample, and concluding that significant differences exist 
whert the null (no differences) Is actually true. ANOVA*avolds 
this problem by holding the overall Type I error rate (regardless 
<^ the npmbex of* treatments) at a. ^ 

3. Hq : aj ti2 " Ui ° 

i 

4. If the null is retained, one concludes that there are no 
differences in the populations represented by the sample means. 

If the null Is rejected, ItS^ans that there is at least one 
difference between population mdfans. Follow-up tests are then 
performed to find where that difference lies. 



« 



379 

o * ' 394 

ERIC ^ 



11.2 POSTTEST 



1« The F ratio expresses the extent b£ differences between groups (or 
treatments) relative to the extent of differences between individuals 
within those groups. 

If the null hypothesis Is false » the expectancy Is for **slgnlf leant" 
differences to occur between groups, and the F value to reach a 
significantly hl^ level . 

2 . a.. RahdofQ error accounts for the denomlnatojcv. 

b. Random error only would account for the/ymunerator. 

c. Random error and a treatment effect would account for the 
numerator. 

d« If the null were false, E(F) > I. ^ 

The reason can be easily demonstrated mathematically. If the 
null la false, there should be a treatment effect. If so: 
random error treatment effect > random error only. The 
numerator will thus tend to be larger than the denominator, 
producing an F value > 1. i , 

3. If the null Is rejected we would conclude that there Is at least 
one difference between population meiuis (check back to Unit 11.1 
for review) . 

Another way of saying this Is that there la a significant treatment 
effect. 

*4.a. a - I- 3 

b. a|n-I) - 419)- 36 . . 

* / 

t 

c. on- I '40-1 

• 39 . ' 



5. Source - SS MS F 





200 


4 


50 


2 ."so 


lUWUn 


800 


40 


20 ■ 




Total 


1000 


44 







380 

ERLC 



11.3 POSTTBST / 

The'-dinalysls Indicated that there vas a significant treatinent effect. 
Looking at the means we can get a general Impression of the nature of 
that effect. The Sports treattnent produced the highest attitudes, 
the Abstract treatment produced the lowest and the Education treattnent 
fell In the tnlddle. Po],low-up tests could he used to makf specific 
comparisons between the three loeans. 

Note: One type of follow-up test would be to make all unique 
comparisons using, the t test fbr Independent samples. Since we pro* 
tected the overall Type I error rate by using F, there Is less concern 
about t&lng t than would have been the case j^herwlse . Howevet» other 
follow-up tests exist and these should be ex^ored before^selectlng t 
as the preferred method. ^ 

Unit XI Review Test 

1. As additional comparisons between pairs of means are made using 
t testa* the overall Type I error rate Increases. By "overall'* 

y Is memit the chances of at least one" Type I error occurring across 

J air comparisons. ^ 

2 . a ^ 

4. 
5. 
6. 
7. 
8. 



a 








• 




c 

a ^ \ - 












C 1 










■ y 


c 


• * 










Source 


SS 




MS 


F 




Screen 


60 


3 


20 


2.31 






390 


68 


8.68 






Total 


6^0 

1 


71 









The decision would be to retain Che null. The coorputed r Is 
soialler than the tabled ^. •Thibre la Insufficient basis for 
Inferring that the population means differ from one another; 
thus there Is Insufficient basis for concluding that any treatment 
Is more effective than any other. « « 



ERIC 



381 



3^0 



Appendix 



Table 1: Normal Curve Area 



Table II: Significance Levels of the t Ratio 
Table III: Slgnlflcanbe Levels of the Chl-Square 



39? 



I 383 

/' 



Table I 



Normal Curve Area 



z 


.00 


.01 


.02 


.03 


.04 


.05 


.06 


.07 


0.0 


.0000 


.0040 


.0080 


.0120 


.0160 


.0199 


.0239 


.0279 


0.1 


.0398 


.0438 


.047* 


.0517 


.0557 


.0596 


.0636 


.0675 


0.2 


.0793 


.0832 


.0871 


.0910 


.0948 


.0987 


.1026 


.•1064 


0.3 


.1179 


. 1217 


.1255 


.1293 


. 1 JJl 


• i JOO 




1443 

. X*T *T J 


0.4 


.1554 


.1591 


.1628 


.#64 


.1700 


.1736 


.1772 


.1808 


0.5 


. 1915 


.1950 


.1985 


.2019 


.2054 


.2088 


.2123 


.2157 


0.6 


.2257 


.2291 


.2324 


.2357 


.2389 


.2422 


.2454 


.2486 


0.7 


.2580 


.2611 


.2642 


.2673 


.2704 


.2734 


.2764 


.2794 


0.8 * 


.2881 


.2910 


.2939 


.2967 


.2995 


. 3023 


. JO5I 


. JU / 0 


n Q 


3159 


.3186 


.3212 


.3238 


.3264 


.3289 


.3315 


.3340 




.3413 


.3438 


.3461 


.3485 


.3508 


.3531 


.3554 


.3577 


1 1 


3643 


.3665 


.3686 


.3708 


.3729 


.3749 


.3770 


.3790 


1.2 


.3849 


.3869 


.3888 


.3907 


.3925 


.3944 


.3962 


.3980 


1.3- 


.4032 


.4049 


.4066 


.4082 


.4099 


.4115 


.<> 1 Ji 


A 1 A7 




4192 


.4207 


.4222 


.4236 


.4251 


.4265 


.4279 


.4292 




4312 


.4345 


.4357 


.4370 


.4382 


.4394. 


.4406 


.4418 


X . Vl 


4452 

• *y *y J £m 


.4463 


.4474 


.4484 


.4495 


.4503 


.4515 


.4525 


1.7 


.4554 


.4564 


.4573 


.4582 


.4591 


.6599 


.4608 


.4616 


1.8 


.4641 


.4649 


.4656 


.4664 


.467 1 


.4670 




AAQ^*^ 

. ^07 J 


1 Q 


.4713' 


.4719 


.4726 


.4732 


>4738 


.47414 


.4750 


. .4756 


2.0 


.4772 


.4779 


.4783 


'.4788 


.4793 


.4798 


.4803 


.4808 


2.1 


.4821 


.4826 


.4830 


.4834 


.4838 


.4842 


.4846 


.4850 


2.2 


.4861 


.4864 


.4868 


.4871 


.4875 


-.4878 


.4881 


.4884 


2.3 


.4893 


.4896 


.4898 


A on 1 






4909 


.4911 


2.4 


.4918 


.4920 


.4922 


.4925 




.4929 


.4931 


.4932 


2.5 


.4936 ' 


.4940 


, .4941 


.4943 


.4945 


.4946 


.4^48 


.4949 


2.6 


.4953 


.4955 


' .4956' 


.4957 


.4959 


.4960 


.4961 


.4962 


2.7 


.4965 


.4966 


.4967 


.4968 


.4969 


.4970 


.4971 


.4922 


2.8 


/. O *7 /. 

.49/4 






^977 


.4977 


.4978 


.4979 


.4979 


2.9 


.4981 


.4982 


.4982 


.4983 


.4984 


.4984 


.4985 


.4985 


3.0 


.4987 


.4987 


.4987 


.4988 


.4988 


.4989 

■J 


.4989 


.4989 


y 3.1 


.49903 
















* 3,2 


.49931 
















^ J.J 


/, QQC 0 
















3.4 


.49966 
















3.5 


.4^977 








< 








3.6 


.49984 
















3.7 


.49989 
















• 3.8 


.*49993 
















3.9 


.49995 












\ 




4.0 


^0000 


























* 385 









.08 



.09 



:0319 
.0714 
.1103 
.1480 
. 1844 
.2190 

.2517 
.2823 
.3106 
.3365 
.3599 

.3810 
.3997 
.4162 
.4306 
.4^429 

.4535 
. 4*625 
.4699 
.4761 
.4812 

.4854 
.4887 
.4913 
.4934 
.4951 

.4963 
.4973 
.4980 
.4986 
.4990 



.0359 
.0753 
.1141 
.1517 
.1879 
.2224 

.^549, 
\2852 
.3133 
.3389 
,36A 

.3830 
.4015 
' .4177 
.4319 
.4441 

.45A5 
.4633 
.4706 
.4767 
.4817 

" .4857 
' .4890 

^,4916 
.4936 
.4952 

.4964 
.4974 
.4981 
.4986 
.4990 



ERIC 



' Table II 







Significance Levels of the t Ratio 


\ 




* 


' Level of Significance 








.10 


.05 

•» 


.02 


ni 


.001 


1 
1 


A '^l d 


12.706 


31.821 


CO ci:7 
DJ. 00/ 


f:of: AiQ 

QOQ. QX7 


O 

c 




• 4. 303 


6.965 


9.925 


«JX . «/70 


o 
0 




3. 182 


4.541 


5.84r 


19 Q41 

X^. 7HX 


4 


2.132 


2.776 


3.747 




8.610 


5 


. 2.015 


. 2.571 


3.365 




6.859 


D 


1 Q il Q 


2.447 


3.143 


J. /[}/ 




7 


1 QOC 


2.365 


2.998 


3.499 


D. *fUO 


o 
o 


1 ocn 
l.obU 


2.306 


2.896 


3.355 


d HAI 


9 


1.833 


2.262 


2.821 


0. ZoU 


4.781 


10 


1.812 


2.228 


2.764 


o- 1 cn 

3. 169 


4.587 


11 

X X 


1 7QA 


2.201 


2.718 


0 1 nc 
-Ot lUo 


A A 07 
4.40/ 




1 7ft9 


2.179 


2.681 


3.055 ' 


yi Old 
4. olo 


1 


1 771 


2.160 


2.650 


3.012 


yi 001 


14 


1.761 


2.145 


2.624 




4.140 


15 


1.753 


2,131 


2.602 




4.073 


16 




2.120 


2.583 ' 


0 Q01 


yi ni c 
4.U10 


17 


1 740 


2.110 


2.567 


2.898 


0 OAR 


18 


1 734 


2.101 


2.552 


2.878 


0 099 


19 


1.729 


2.Q93 


. 2.539 


9 fid 


3.883 


20 


1.7€5 


2. 086 


2.528 


9 fiAR 


3.850 


21 


1. 721 


2.080 


2.518 


9 fi*51 


0 filQ 


22 


1 717 


2.074 


2.508 


2.819 


0 7Q9 
0. /7^ 


23 


1. 714* 


2.069 


2.500 


2.80> 


0 7A7 
i^. iOi 


24 


1.711 


2.064 


2.492 


9 707 


3.745 


25 


1.708 


2.060 


2.485 


9 7Q7 


3.725 


26 


1 706 


2.056 


2.479 


.9 770 


0 7n7 

0. /U/ 


27 


1 703 


2.052 • 


2.473 


2.771 


0 con 


28 


1. 701 


. 2.048 


2.467 


2.763 


J. 0/ H 


29 


1.699 


2.045 


2.462 


2.756 


3.659 


30 


1.697 


2.042 


2.457 


2.750 


• 3.646 


40 


1.684 


2.021 


2.423 


2.704 


3.551 


60 


1.671 


2.000 


2.390 


2.660 


3.460 


120 


1.658 


1.980 


2.358 


2.617 


3.373 


m 


1.^645 


1.960 


2. 326 


2.576 


3.291 



-Table IJ is abridged from Table iv of Fisher & Yates, Statistical Tables 
for Bio\ogical> Agricultural and Medical Research > published by Longman 
Group London (previously published by Oliver and Boyd, Ltd., 

Edinburgih) , and by permission of the authors Aid publishers. 

386 

39j 



Table III 

y 

significance Levels of the Chl-Square Test 



Level of Significance 





.01 


.05 


.10 


1 
X 


*6 64 


3.85 


2.71 




Q 21 


5 .99 


4.60 




M 34 


7 .82 


6.25 


A 


13 28 


9 .49 


7.78 


5 


15 .09 


11 .07 


9.24 


,6 


16.81 


12.59 


10.64 


7 


18.48 


14.07 


12.02 


8 


20.09 


15.51 


13.36 


9 


21.67 


16.92 


14.68 


10 


23.21 


18.31 


15.99 


11 


24.72 


19.68 


17.28 


12 


26.22 


21.03 


18.55 


13 


27.69 


22.36 


19.81 


14 


29.14 


23.68 


.21.06 


15 


30.58 


25.00 


22.31 



16 32.00 26.97 23.54 

17 33.41 27.59 24.77 

18 34.80 28.87 25.99 

19 36,19 30.14 27.20 

20 37.57 3ir.41 28.41 



21. 


38.93 


32.67 


29.62 


22 


40.29 


33.92 


30.81 


23 


( 41.64 


35.17 


B2.01 


24 


^ 42.98 


36.42 


33.20 


25 


44.31 


37.65 


34.38 


26 


45.64 


38.88 


35.56 


27 


46.96 


40.11 


36.74 


28 


48.28 


41.34 


37.92 


29 


49.59 


42.56 


39.09 


30 


50.89 


43.77 


40.26 



Table III Is abridged from Table IV of Fisher & Yates, Statistical Tables 
for Biological, Agricultural and Medical Research , published by Longman 
Group Ltd,, London (previously published by Oliver and Boyd, Ltd., 
Edinburgh), and by permission of the authors and publishers. 




40i) 



Index 



Absolute value, 160 

Adding numbers, 2 

Alternative hypotheses, 229-231 

Analysis of variance, 314-329 
computation, 317-320 • 
rationale, 314-316 
summary tables, 321-322, 
324-325. 



Basic Math, review, 1-15 
Bell-shaped distribution, 35-37 
Between groups sum of squares) 

319-321 
Biasied statistics, 203 
Bimodal, 36 
Bobolinckskismith, Natasha, 131^ 



Categorical measure, 300 
Causation , 146 
Central Tendency, 40, 58-74 
Chi-square test, 299-310 

one-way test, equal frequencies 
301-303, 310 

for goodness-of-f it , 303- 
305, 310 
two-way test, 307-310 
Yates* correction, 305 
Continuous measures, 17 
Correlation, coefficient, 159-162 
computation, 164-172 ^ 
definition, 138 
determining strength, 156-157, 
160 

Xn samples, 207 
plotting, 147-156 
fi^teps in applying, 142-150 
Cumulative frequency, 22-34 
graphs, 33-34 



Degrees of freedom, in ANOVA, 
319-321 

in cl^i-square, 302, 305, 309 
in 4r test, 264, 273, 278, 287 ^ 



Dependent trials, in probability, 
185-188 

Defilcriptive statistics, 217 
Discrete measures, 17 ' 
Dividing numbers, 4-5 

T 



Expected Value, of Satapling Distribu- 
tions, 211 
of F distributions, 318 
Exponents , 12-13 



Form of distribution, 199. See 

frequency distributions. 
Fractions, 7-8 

Frequency distributions, forms, 35-38 
grouped, 25-28 
■opeti^^ii4e£, 69-70 
ungrouped, 22-24 
versus Sampling Distributions, 201 
Frequency polygon, 29-31 



Group effect in ANOVil. See treatment 
effect. 



^liatograms, 32 

Hypothesis testing, probability level 
244-254 
rationale, 227-229 
Type I and II errors, 233-244 
Versus confidence Intervals, 289 



Independent trials, in probability, 
184-185 

Inferential statistics, 217 
Interpolation, 47-57, 64 
Interval estimation, 289 
Interval scores, 299 
IQ scores, 107 



ERIC 



389 



401 



Lower limits. See real limits. 



Mean, computation, 59-62 
In skewed and nonskewed 

distributions, 69-74 
of samples, 206 
Sampling Distribution of, 
209^224 
Mean Squares, 321-322 
Median, computation, 63-66 
In samples^ 207 
In skewed akid^onskewed dis- 
tributions, 69-74 
Midpoints, 25, 26, 2a,. 30-32, 
61-62 

Mode, determination of, 68 

in frequency distributions, 36-37 
in skewed and nonskewed dlstrlbu- 
\tlons, 69 

Multiplying nuud^ers, 3-4 

Mutually exclusive, 179 



Negative correlation, definition, 
141 

imoerfect negative, 161 
perfect negative, 161 
Nominal scores, 299-300 
Nonmutually exclusive, 179 
N9rmal curve, description, 
109-113 
area between raw scores, 
126-128 

area between z scores, 113-125 
percentile ranks, 129-136 
Null hypothesis, 229-231 
In ANOVA, 315 
in chl-square, 301 
in comparing means, 271 
In testing correlations, 286 



Order of operations, 6-7 
Ordinal scores, 299 



Parameters, defined, 198 « 
Pearson See correlation coefficient. 
Percentile ranks, definition, 40-41 

computing, 42-52 

converting to raw scores, 53-57 

on normal curves 129-136 
Point estimation, 289 
Positive correlation, definition, 141 . 

lmperfect< positive, 161 

perfect positive, 160 
Probability, addition rules, 179-182 

defined, 177-178 

multiplication rules,' 183-188 
Probability level, in hypothesis 
'testing, described, 244-245 

relation to Type I and II errors, 
^5 1-252 " 



Random error, 317-318 ( ) 

Random sampling, 143, 192-196 
Range, computation, 78-79 

of samples, 206 
Ranked data, 170 
Ratio scores, 299 
Raw score formula for correlation, 
168-170 

Real limits of scores, 16-20, 44-46 
Rectangular distribution 35, 37 
Relative frequency, 177 
Roots, 12-13 ^ 

4 



Sample. space, 177 
Sampling Distributions, defined, 
200-202 

In constructing confidence Intervals, 
290-291 

in hypothesis testing, 236-240 
of means, 209-224 

central tendency, 211 , 

computing probabilities, 218-224 

form, 210 

variability, 212 
Sampling fluctuation, 201, 203 
Scatterplot, constructing, 147-153 
interpreting, 153-157 
slope, 156-157 



390^ 4(j 



■ r 



Signlfi^cance level. See proba- 
bility level. 
"TSkeVni^ss, 35-37 
Solving for unknovms , 8-9 
Spearman correlation, 170-172 
Standard deviation, domputation, 
89-92 
definition, ,89 

in normal distributions, 91, 
113-115 

in samples, 92 
Standard Error, of Sampling 
Distributions, 213 

Computation, 215-216 , 
Statistics, defined, 198 
Subtracting numbers, 2-3 
Sum of squares, 319-321 
Symmetrical distribution, 35, 37 



Whale, Bubbles, the, 17-18 

Within groups sum of squares, 319-321 



Yates ' correction , 305 



Z score, computation, 96 

converting to raw scores, 98-99 

converting to T scores, 103-105 _ 

definition, 95 

properties, 100-103 

on normal curve, 116-134 

z score formula for correlation, 164-168 

z test, 256-261 

zero correlation, 161 



4 



t statistic, in confidence 
intervals, 282-293 

See t test. 
t test, for correlation, 286-288 

for means, parameter comparison, 

using dependent samples, 

278-283 
using independent samples, 
271-277 
versus ANOVA, 315-316 
T scores, 103-106 
Total sum of squares, 319-321 
Treatment effect in ANOVA, 318 
Ttiangular distribution, 35 
Type I error, 240-244 
Type II error, 240-244 



Unbiased statistics, 203 

Upper limits. See real limits. 



Variability, definition, 77-78 

Variable, in correlation, 139 4 

Variance, coi^putation, 80-88 

definition, 80 

in samples, 85, 207 



ERIC 



